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© A synthetic RNA catalyst capable of cleaving an RNA substrate/: the "catalyst comprising a substrate binding 
portion, and a "hairpin" portion. The invention also provides an engineered DNA molecule and a vector, each 
comprising a DNA sequence coding for an RNA catalyst according to the invention. The invention further 
comprises host cells transformed with the vectors of the invention which are capable :pf expressing the RNA. 
substrate. Finally, the invention provides a method of cjeaving ah RNA substrate which comprises contacting the ; 
substrate with a synthetic RNA catalyst according to the invention; 
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RNA CATALYST FOR CLEAVING SPECIFIC RNA SEQUENCES 

This: application ; is a continuation-in-part of application Serial Na 07/247,1 00 filed September 20, 1988, 
the disclosure of which Is incorporated herein by reference, - . . 

s FIELD OF THE INVENTION 

The present invention relates to an RNA catalyst which cleaves -specific RNA. sequences into a fragment 
having a 5; hydroxyl and a fragment having a 2' ,3' cyclic phosphate. The products of the reaction 
described herein resemble those resulting from the natural hydrolysis of RNA. : : 

., BACKGROUND ; OF TH E INVENTION 

; Certain naturally : occurring sate 

rs cleavage. Self-cleavage has been demonstrated in vitro for avocado sunblotch viroid (ASBV) (Hutchins, C. ! = 
J., Rathjen, P. D„ Forster, A C. and Symons, R. H. (1986) Nucleic Acids Res. , 14: 3627-3640), satellite ■ 
RNA from tobacco ringspot virus (sTRSV) (Prody, G. A, Bakos, J. T., Buzayan, J. M. t Schneider, I R. and ■ 
Bruening, G. (1986) Science , 231 : 1577-1580; Buzayan, J. M., Gerlach, W. L. and Bruening, G. B. (1986) 
Proc. Natl, Acad. Sci. U.S.A. 83: 8859-8862) and lucerne transient streak virus (vLTSV) (Forster, A. C. and 

20 Symons, R. H. (1987) Cell, 49: 

211-220). These self-catalyzed RNA cleavage reactions share a requirement for divalent metal ions and 
neutral or higher pH and cleave target, RNA sequences to give 5' hydroxyl and 2\3'-cyclic phosphate 
termini (Prody, G. A, Bakos, J. T ; Buzayan, J. M„ Schneider, I. R. and Bruening, G. (1886); Science , 231: 
1577-1580; Forster, A. C.; arid Symons,; R V H. (1987) Cell, 49: 21 1-220; Epstein, L. M. and Gall, J. G. (1987) 

25 ; Cell, 48: 535-543; Buzayan, J. M. : Gerlach, W, L, Bruening, G. B., keese, ; P. -and Gould, A. R. (1986) 
Virology, 151: 186-199), 

A "hammerhead" model has been proposed and: accurately describesihe catalytic center of ( + )sTRSV 
RNA, the ( + ) and (-) strands of ASBV and the ( + ) and (-) strands of vLTSV (Forster, A; C. and Symons, R 
H. (1987) Cell, 49: 211-220). The single: exception is (-)sTRSV RNA which does not fit the "hammerhead" 
30 model (Forster, A. C. and Symons, R. H. (1987) Cell,. 49: 

211-220; Buzayan, J. M., Gerlach, W. L and Bruening, G. (1986) Nature , 323: 349-352; Buzayan, J. M., 
Hampel, A. and Bruening, G. B. (1986) Nucleic . Acids Res, , 14: 9729-9743), and the structure of whose 
catalytic center was unknown prior to the present invention; It is therefore understandable that the primary 
scientific focus has been on studying the "hammerhead" consensus structure and, as regards sTRSV, on 
35 studying the ( + ) strand. 

Intermplecular cleavage of an RNA sub^ the "harnmerhead^ rhodel was 

first shown Ih^^V 328: : 596-600);; T^ recovered and ■ 

; reacted. with multiple ;RNA: molecules, ctem^^ . 

Catalytic RNAs jdesighed 

40 sequences by making appropriate 1 base changes in the catalytic - RNA to : maintain : necessary base pairing ; ; 
with the target sequences (Haseloff and Gerlach; Nature , 334, - 585 (1988)^ 

334 , 196 .(1988); Uhlenbeck, O. C. (1987) Nature , 328 : 596-600; Koizumi, M., Iwai, S. and Ohtsuka. E, (1988) 
FEBS Lett. . 228 : 228-230). This has allowed use of the catalytic RNA to cleave specific target sequences 
and indicates that catalytic RNAs designed according to the "hammerhead" model may possibly cleave 
45 specific substrate RNAs in vivo , (see Haseloff and Gerlach, Nature , 334. 585 (1988); Waibot and Bruening, 
Nature , 334 , 196 (1988); Uhlenbeck, O, C. (1987) Nature , 328: 596-600). 

However, catalytic RNAs such as these that were designed based on the "hammerhead" model have 
several limitations which restrict their use in vitro and may forestall their use in vivo- For example, the 
temperature optimum for the (turnover 
. so number) is - only 0.5/miri even at .55 * C (Uhlenbeck, O/ C. (1987) Nature, 328:596-600; Haseloff and Gerlach, 
Nature , 334 , 585 (1988)). In addition, the Km is 0.6uM (Uhlenbeck, OTc: (1987) Nature , 328:596-600) 
meaning that the" reaction requires high concentrations of substrate which makes it difficult, if not: 
impossible, for the catalytic RNA : to cleave low levels of target RNA substrate such as would be 
encountered in vivo. 

Cech et aT published application WO 88/04300 also reports the preparation and use of certain synthetic : 
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ribozymes that have several activities, including :endoribonuc!ease activity. The design of these ribdzymes 
is based on the properties of the Tetrahymena ribosomal: RNA :seif»splicing, reaction. A temperature 
optimum '.of 50°C is reported (pages 37 and .39) for the endoribonuclease actvity, and the Km and kcat 
reported for! this activity were 0.8uM and .0:i3/rninute, respectively (page 
5 In yiew of the above; ' there . is ;a need -for an RN^ 
preferably near physiologlcal .te^ 
/engineered to ciir specific target RNA substra^ 

structure disclosed hereinafter; it is an object of the present to provide such^ 

Other objects, and features 
70 ^accordingly comprises the: products. and methods hereir^fter described and t^ 
the invention being Indicated; In the appended claims. '■■= 

SUMMARY OF THE INVENTION 

15 

The invention comprises a synthetic RNA catalyst capable of cleaving an RNA substrate which contains 
the sequencer 
5 -Fi -CS-F2-3 , 
wherein; - 

20 CS Is a cleavage sequence; and 

Fi and F2 each is a sequence of bases flanking the cleavage sequence. 

The: catalyst comprises a substrate binding portion and a "hairpin" portion. The substrate binding 

portion of the catalyst has the sequence: : 

3'-F*-l,-F 3 -5' 
25 wherein,;--' 

F3 is a sequence of bases selected so that F 3 is substantially base paired with F2 when the catalyst is 
bound to the substrate; 

F* is a sequence of bases selected so that F* is substantially base paired: with Fi when the catalyst is 
bound to the substrate; 

30 the sequences of F3 and F* being selected so that each contains an adequate number, of. bases to achieve 
sufficient binding of the RNA substrate to the RNA catalyst so that cleavage;of the substrate can take place;: : 

••and - •'• : ; . : : - : ; '•• - :: \ '.' 

Lt is a sequence of bases selected so that: Li does not base pair witfrCS when the catalyst is bound to the 
. substrate. ••' ' 

35 The : "hairpin" portion is a portion of the catalyst that assumes a hairpin-like : configuration when the 
substrate-catalyst complex is modeled in two dimensions for minimum energy folding. The "hairpin" portion 
of the catalyst preferably has the sequence: 

■" 45 " ■ ' .■ . ' ' : - .;• '•" " ' ; . ' ' ; : ' ' / . ; •. '• • ' ' . . .. / . 

•. wherein, . ■ ' • ••. ' 'v" - : • .• ::' : y : - /.:■:"■■" • . 

P 1 and P+ each is a sequence of bases, the sequences of Pi and Pv being selected so that Pi and P* are ... 
substantially base paired; 
Pi is cbvalehtly attached to F*; 
so Si and S2 each is a sequence of bases, the sequences df' Sr and S2 being selected so that Si and S2 are . 
substantially unpaired; //' : '.--'. 

:p 2 and P 3 each is a sequence of bases, the sequences of • P2 and P 3 . being selected so that P2 and P3 are 
' substantially base paired;: and : 

: . '. : L2 is a sequence of unpaired bases. \ : . : .'. ' : ' - 

: : 55 RNA catalysts according to the invention -can cleave substrates: of any 

: contain an appropriate -cleavage sequence, in particular; the catalysts can; l^- used to cleave a specific 
. -sequence :in : ;haturally-occur|hg RNA having • a cleavage sequence, as; wellVas : RNAs which have been 
engineered to contain a cleavage sequence. 
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The invention further comprises an engineered DNA molecule and a vector, each of which comprises a 
DNA sequence that codes for an RNA catalyst according to the invention. The invention also comprises a 
host transformed with the vector, the host being capable of expressing the RNA catalyst. In particular, hosts : 
can be transformed with vectors that; when transcribed, will produce RNA catalysts which can cleave any 
5 RNA, native or foreign, found In the host For example, hosts can be transformed with vectors that, when 
transcribed,; produce RNA' expression of genes: by cleaving! messenger v 
.RNA^ 

■ in vivo in prokaryotes and eukaryotes of plant or animal : i origin i^ 
' controlling viral infections. : • 
70 Finally; the invention; includes a method of cleaving .an RNA" substrate- compris^^^ 

substrate with an RNA catalyst according to the invention. The reaction is unique because it occurs under 
physiological conditions, having a temperature optimum hear 37*.C, with very favorable reaction param- 
eters. The method can be practiced in vitro or in vivo. For instance, the method may be practiced in vivo in 
host ceils that have been transformed with aTvector that codes for an RNA catalyst according to"~the 
75 invention. 



BRIEF DESCRIPTION OF THE DRAWINGS 

so , "Figure 1 shows the (-)sTRSV RNA substrate-catalyst complex: that fits the "hajrpin" model of catalytic 
RNA in accordance with the present invention. 

Figure 2 shows minimum energy folding of (-)sTRSV RNA. 

Figure 3 shows the time course of catalysis of a substrate RNA by the catalytic : RNA; 
Figure 4 shows the Michaelis-Menten kinetics of the RNA catalytic reaction. 
25 Figure 5 shows the temperature dependence of the RNA catalytic reaction; 

Figure 6 shows the dependence of the rate of reaction on concentration of catalytic RNA. 
Figure /.shows the reaction properties of a smaller RNA substrate, 

Figure 8 shows loss of catalytic activity; when the terminal A at position 175 or the terminal bases AU 
at positions 175 and 176 are removed from the catalytic RNA. 
30 Figure 9 shows loss of catalytic .activity when bases 195-203 in tHe catalytic RNA sequence are 

removed. . " . 

: Figure 10 shows loss of catalytic activity when bases AAA at pbsitibns 203, 202 and 201 are changed: 
; to CGU respectively, 

•: Figures 11A-C show that there Ha hq : eiffect : o n catalytic { activity when • base A at position 49 in the 
35 substrate is changed to a G, U or C. " 

Figure 12 shows that different target RNA sequences can be used as ; long as the base pairing with 
the catalytic RNA in the regions flanking the cleavage sequence is maintained. 

Figure 13 shows that an RNA sequence found in tobacco mosaic virus can be cleaved at a specific 
site with the catalytic RNA of the present invention. 
40 Figures 14A-C show three substrates having sequences found in the sequence of the messenger. 

RNA coding for chloramphenicol acetyl transferase. Figures 14A-C also show the separation patterns on 
acrylamide gels of the reaction products obtained by reacting these substrates with catalytic RNAs 
designed to base pair with the substrates in the regions flanking the AGUC cleavage sequence. 

Figure 15 shows the sequence of a substrate having a sequence found in the sequence coding for 
45: the: gag protein of the HIV-1 virus which; caused AIDS; ■ Figure: 1,5' also, shows the separation patterns on 
acrylamide gels of the reaction products; obtained by: reacting this substrate : with a catalytic RNA designed 
to base pair with the substrate in the regions flanking the CGUC cleavage sequence of the substrate. 

Figure 16 shows the sequence of a substrate having a sequence found in the: sequence: coding for . 
the regulatory tat protein of the Hi V-1 virus i Figure i 6 also shows the separation patterns on an acrylamide ■■ 
so gel of the reaction products obtained by reacting this substrate with a catalytic RNA designed to base pair 
with the substrate in the regions flanking the UGUC deavage sequence: of. the substratei, ^ 

+ ■ Figure 17 shows the -sequence of a substrate haying four.. non-native U's added : to the : 3 f end ;of i the : > 
sequence of the native (-)sTRSV substrate shown in. Figure 1 ■: Figured /' also shows the Reparation patterns : 
oh an. acrylamide gel of the reaction products obtained by reacting this - substrate with different cbncentra- i: 
55 tions of a catalytic RNA designed to base pair with the substrate in the regions fiahkihg the v cleavage : 
sequence of the substrate, including with the four non-native U's. . : 

Figure 18 summarizes: the; sequence requirements for the target region of the substrate • RNA. Only ; /. 
. • ' GUC : is required for cleavage: :' 



EP 0 360 257 A2 



Figure 19 shows the positions of base changes (open boxes) made in the sequence of the catalytic 
RNA shown inFlgure 1 in order to prove the existence of Helices 3 and 4 predicted by the "hairpin" model 
for (-)sTRSV RNA. Rgure 19 also shows the separation patterns pn aery lamide gels of the reaction products : • . 
obtained by reacting the various catalytic RNAs with substrate RNA S 17. : : : : ' ■ v 
s Rgure 20. shows the :; RNA sequence of an autocatalytic < cassette, that i has utility ;; in terminating :. 

transcription afa very specific site. Figur$:20: : W^ 

reaction products obtai ned when this catalyst was transcribejd ; and cleaved autocataiytica|ly . 
■ : - : . Rgure 21. shows the pbsrtions of two base chang^ : 
•: RNA sequencFshown in • Figure 1: Figure 21 al^ : shows ,the.;se of the ■ 

•70 reaction: products obtained by ^ reacting , these catalysts or cata!yst ; R51 (control gel) : with substrate SI 7. : 

Figure 22 A shows a partial map of plasmlij pHC-CAT containing the: CAT gene linked to the *hairpin " ; 
autocatalytic cassette of the invention. Also shown is the expected RNA transcript of the illustrated region 
and the expected 5 fragment of the autocatalytic cleavage. 

Figure 22B shows the result of Northern blot analysis of RNA produced by hoist cells transformed 
75 with pfTS^Af 

DETAILED DESCRIPTION OF THE DRAWINGS 

Figure The "hairpin" model for RNA catalysis was developed by determining the minimum energy 
20 folding predicted by computer modeling of the catalytic complex containing the .■ minimum size catalytic 
RNA and Vubstraite RNA of (-)sTRSV RNA. It is this minimum energy folding which is shown in Figure 1. 
Two mblecuies are shown folded: (1) catalytic RNA which contains 50 bases; of satellite RNA (224-175) and , 
(2) sub^at^ iRNA which contains 14 bases of satellite RNA (53-40). The arrow represents the site of 
cleavage.' . / . 

25 The 50-base catalytic RNA and the 1 4-base substrate RNA are the "minimum size in the sense that 
reductions in their length result in a substantial or total loss of catalytic activity as Is shown in the Examples 
below. Thus; this length of (-)sTRSV catalyst sequence is preferred to shorter lengths. Also, substrate RNA 
having at least the degree of base-pairing with the catalyst exhibited by the 14-^ is preferred. . 

Rgure 2. Minimum energy folding of ^ the Wisconsin 

30 RN/Tfoi^ng progra^^ (Zucker, M and ? Stiegier, P ) Nucleic Acids Resf 9: 1 33-148; Devereux, J„ : 
Haeberii ' P; and Smithies, O. (1 984) Nucleic Acids / Res- , 1 2: 387-395) with base numbers corresponding to 
( + )sTRSV (Buzayan, J. M., Gerlach,; W. L, Bruening, G. B., Keese, P. and Gould, A. R; (1 986) Virology, ■ 
i51: 186-199). With this numbering scheme' ^ 

number.:^ sequence is between bases 53-40 ,; 

35 and the catalytic RNA sequence is between bases 224-175. The arrow is the site of cleavage. 

The folding Identifies regions of expected base pairing and expected non-base pairing, loops. This 
model does not preclude higher order interactions occurring between the loops. 

Figure 3. Time course of substrate S17 cleavage by catalytic RNA R51. : The reaction was carried out 
under standard conditions, which were 37* C in 12mM MgCb, 40mM Tris pH 7.5 and 2mM spermidine, for 
40 the following times: lane 1. 30 sec; lane 2, 5 min; lane 3, 15 min; lane 4, 30 min; lane 5, 60 min; lane 6, 90 
min; lane 7 t 150 min. Concentrations were as follows: [R51] = 0.0032uM and [S17] = 0.09uM. RNA was 
separated on 7M urea, 20% acrylamide gels, bands cut out and counted in the liquid scintillation counter for 
. Rgures 3-1 7 and 1 9-21 . Throughout the figures, the designations s'f and 3' F are the products of cleavage 
•:• of the substrate and represent ttie resulting 5^fragments and 3: fragments, respectively. 
45 Rgure 4. Eadie^ ^ The : reaction was ;,: 

' - carried out ^t 37* C i 7.5' and 2mM; spe^jdine^ 

follows: ;[R5i ] == 0.0004uM and < [S17] = 0.1 25uM (lane 1), 0.0625uM; (lan^2)-: 0.0417uM 3), 0.031 uM; 
(lane 4), 0;021uM (lane 5), 0:0156uM (lane 6); 0.0078uM (lane 7) and 0.0039uM (lane 8). 

Rgure 5. Temperature dependence of the rate of cleavage of substrate RNA S17 by catalytic RNA R51. ; 
so The reaction was carried out in 12mM MgCl2, 40mM Tris pH 7.5 and 2mM, "spermidine; at 45°C (lane 6), : : 
41* C (lane 5), 37*C (lane 4), 33*C (lane 3), 27* C (lane 2) and 20 *C (lane 1). The concentrations used 
were: [R51] = 0.003uM and [S17]=0.09uM. R51 unlabeled. The velocities shown in the graph in Rgure. 5 
were calculated by the use of time points of 8 and 16 minutes. The; separation patterns of the reaction 
products on the. acryiamide gel shown in the figure are for the 1 6-minutei time point. ^ . ^ 
55 Figure 6. Rate of cleavage of substrate RNA S1 7 at varying concentrations: of. c^ R51 v Jhe > 

. '-. reaction was carried out at 37 * C in 12mM MgCI 2 ^40mM Tris pH 7.5 and 2mW "sperm 
: .. 1 and 2), 20 min (lane 3).; : 1Q min Oane 4) arid 5 min 
."•:•"' -0.175uM. • : ; ' ' '■'"''■ 
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Figure 7. Eadle Hofstee plot of catalytic RNA R51 cleavage of substrate RNA S10. the substrate S-10 ; ; 
containing The RNA sequence. GACAGUCCUG was prepared from a DNA tempjate containing ; the t-7 :. . 
promoter as described in Example 2. This substrate was. mixed with the catalytic RNA.RSI, from Example v !: 
2 under standard conditions: 37* C in12mM MgCk, 40 niM Tris pH 7.5 and 2mM. spermidine for. 10 min. - : 
5 Concentrations of substrate used were as follows: 0 1 15uM, 077uM, 0.03^(vi, 6.029uM/ 0.014uM, The : 
concentration of catalytic RNA,, R51, used was. 1nM. The line was fit' b/ linear regression analysis 
intercept, kcat, and Michaelis cpnstant, Km, calculated. 

Figure 8; Removal: of the termini catalytic RNA. The "A" at base position 

: 175 (circledj was removed and -{he: res ulting catalytic: RNA, ; R In addition; two : ■ 

iq : bases were • removed , to give \ R49 which had both A1 75 and U 176 removed: Concentration of substrate' Si 7 
: was • 0:3uM and all catalytic RN A concentrations were 4rife^ under. standard;:: 

conditions. Lane 1 R51/S17; Lane 2 R50/S17; Lane 3 R49/S17; Lane : 4 SI 7 alone. A 75% loss of activity- ■ 
was seen with R50 and R49. 

Figure 9. Loss of activity when bases 195-203 in the catalytic RNA sequence are removed. When the . 
rs underlined bases were removed and the adjacent bases ligated together, no catalytic activity was seen. 

Figure 10. Loss of activity when bases AAA at positions 203, 202 and 201 are changed to CGU re 
spectively. When the circled AAA bases were replaced by the underlying 5'-CGU-3' bases, no catalytic 
activity was seen. 

Figure 11 A. No effect no: activity when base A49 in the substrate is changed to a "GV The circled "A" 
20 base 49 in the substrate was changed to a "G" and no effect on activity was seen. The concentration of 
R51 was 0.016uM, [S17] = 0.4uM, and [Si 7(A->G)] = 0^uM. Reaction under standard: conditwn^ for 40 
min. Lane 1 :S17; Une 1S1T/R5t;:;^ i 

Rgure 11 B. No effect on activity was seen when base ;A49 in the : substrate was changed to a W U" (SIT'- 
fA^U)). The concentration of substrate RNA Sl7(A--> y) ; used was 6:1 2uM and : the : concentration of R51 
25 : was 0.0065uM. Reaction was for 60 minutes under standard conditions: The catalytic^ ; 
Figure 11C . No:effect on activity was seen when base A49 in the substrate 
(A~>C)). The concentration of substrate RNA S1 7(A~>C) |used: ;was ; 0,08uM ;: and ■ the concentration pf R51 
was 0.0065uM. Reaction was for 60 minutes under standard; condition s; The catalytic R^ 

Figure 1 2. Different target RNA sequences can be used as long as the base pairing : is maintained with 
30 the catalytic RNA. The C:G base pair predicted by the "hairpin -V model of the catalytic com 

)sTRSV, Rgure was changed to a G:C base pair (circled) and activity was main^ In this experiment 
the , usual substrate S1 7 was not used; instead a ; new : substrate ;w^ used ^ exact same; sequence 
• except: the fir^ two vector bases GC were eliminated|The resulting seq^ S15 
was gUGACAGUCCUGUUU. The substrate containing the C50-> G50 base - change was Si5(C->G) And the 
35 catalytic RNA with the G214->C214 base change was : R51 (G-> C). The reac tibhs were run uhd standard 

conditions for ;60. niih a^ V 
1 R51/S15; Lane 2 R51(G->C)/S15; Lane 3 S15; Lane 4 R51(G-:>C)/S1^ 

Figure 13. An RNA sequence found in tobacco mosaic virus (TMV) can be cleaved at a specific site. : 
The substrate RNA shown is that beginning with nucleotide #538 in the tobacco mosaic virus sequence: 
40 The catalytic RNA has been synthesized to : base pair with the TMV substrate RNA in the stem regions of 
the "hairpin" as shown by the .circled base pairs. Reaction was for 20 min under standard conditions with a 
catalytic ; RNA concentration of 3nM and a substrate concentration of 0.06uM. Lane 1 TMV substrate RNA : 
only; Lane 2 TMV catalytic RNA/TMV substrate RNA, 

Figures 14A-C . The sequences ; ; bf three - substrate : RNAs having sequences found in the messenger -: 
45 : RNA for chloramphenicol acetyl: transferase (CAT) are shown; They have 14-, 16- and 18-base long target ' : 
sites, and the length, of the 3' regions flanking the AGUC cleavage sequence -is extended in substrates (B) 
and (C) as compared to substrate: (A). Catalytic RNAs designed to base pair with the. substrate RNAs in 
both the 3 and 5 regions flanking the cleavage sequence AGUC were synthesized. The open boxed bases 
are those which are different from those in the native (-)sTRSV substrate RNA sequence shown inRg. 1. 
50 Figure 15. This figure shows the sequence of a substrate RNA having a sequence: found in the region of 
the HIV-1 viral RNA which, specifies the gag protein. A catalytic RNA was made whose . sequence was; 
designed so that the catalyst would base pair with ihe substrate RNA in both the 3' and 5' regions flanking , 
the cleavage sequence; Thei open boxed bases are those which are. different than those of the native ""■ <4 x 
)sTRSV sequence shown in Rgure 1. The catalytic RNA cleaved :the. substrate RNA at the arrow} ,;• ;•: . 
55 ■ Figure 16. Shown is the sequence of. a substrate RNA having the sequent 

the coding region for' the regulatory protein tat of the HIV-1 virus. :A catalytic RNA was made w ^wtef^M- 
; designed so that it would base pair wjth the substrate RNA in both the 3' and 5; Regions flanking : the UGLlb • : !#•••••;••' 
. catalytic cleavage sequence.: The open- boxes are bases . which are different; fronri those of the: native ;:: :(-vv '"^W 
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JsTRSV substrate sequence shown in Figure 1. Cleavage occurred at the arrow as shown. 

Figure 17. Shown is the sequence of a substrate RNA having four non-native bases (UUUU ) ; : added to 
the 3 end of the sequence of me native (-)sTRSV substrate RNA shown In Fig. 1 . A corresponding: catalytic; 
RNA was made whose sequence was designed so it would: base pair with the substrate in both the : 3::and 
5 ; 5 : regions, flanidng . the AG cleavage sequence^bieavag© rates : w 
/•: tibn arid^y^^ 

• : ^rylamid^ the radioactivity: and^p^ procedures ;to 

calculate: Km and kcat. :: . : 

Figure 18. Summary of the sequence requirement^ 
70 sequence isTequired for cleavage of the substrate as long as; the : short of bases in the regions v ; 

of the substrate flanking the cleavage sequence kre .substantially- base; paired with corresponding regions of • 
the RNA catalyst. The regions of base pairing are labeled 1 Helix i and Helix 2 in the figure. Also, the regions 
of base pairing predicted by the "hairpin" model for (-)sTRSV to exist in the "hairpin" portion of the catalyst 
are labeled Helices 3 and 4. 

is Figure 19, Confirmation of the existence of Helices; 3 and 4 predicted by the "hairpin" model for (- 
JsTRSV RNA. A G->C base mutation in base 35 {count bases from the 5' end of the catalytic RNA 
sequence shown) of the: (-)sTRSV catalytic RNA ; sequence shown in Fig. 1 resulted in an RNA with no 
catalytic activity;: (Lairi^s; 3 and • 4 i ( "mismatch ")). A dbu ble m utant , G35-> C; C27-> G had restored catalytic 
activity (Lanes 5 and:; 6:: These two base changes are in the Helix 4 region whose 

20 existence is . predicted^ by ; the vhairpih" m Also, a catalytic RNA having. a ; single base \ 

change at pbsition ;4^ (Lanes 9 and 10), while a double mutant, with j a second 

mutation Ci 7^^ and 12). these two base changes iare iri :tH6 ;: Heli^; 3 ^ 

region whose existence is predicted by the "hairpin" model. The control (Lanes 1 , 2. 7 and 8) is c|eavage 
the substrate;;RNA Sl 7 having the native.fjsTRSV sequence by catalytic RNA sequence R51 hawing :;;the 

25 native sequence. 

Figure 20. ■ The RNA sequence :bf a synthetic 
shown in Fiiijr^ 20> the same as that of the catalyst shown in Figure 1, but with addrtional 
% to form a loop at the'Srend of theHatalyst and to provide a substrate target sequence (i.e.; a cleavage 
sequence and upstream ind down which can bind to the substrate binding portion 

ao* the catalyst sequent ^Such -an RNA was prepared. When transcription was performed,- trie cassette :. 
autoc^alyticallycle^ec^ give the appropriate 3*F and s'F products. 

. Figure 21; : Shown are: two: ta^ that were made in the native (-)sTRSV catalytic sequence,: : 

shown in Figure 1 . The too bases changes were an "A" to "U" mutation at position 217 and: a "G" to : "C" 
mutation: A 

35 products obtained by reacting one of these catalysts or R51 (control) with substrate S17. Both base : , 
changes produced catalysts that were: Inactive when the catalysts were reacted with substrate S17 under 
standard conditions for 15 minutes. 

Rgure 22A shows a partial map of plasmid pHC-CAT containing the CAT gene linked to the "hairpin" 
autocatalytic"cassette of the invention. Also shown is the expected. RNA transcript of the illustrated region 
<to and the expected 5 fragment of the autocatalytic cleavage. 

Rgure 22B shows the result of Northern blot analysis of RNA produced by host cells transformed with 
pHC^CATT 

45 DESCFjiPTlONOF THE PRESENTLY PREFERRED EMBODIMENTS 

An RNA : catalyst: h^ which can be engineered to ; cut, : : 

with great precision^ target . RN As having^ a cleavage sequence. - In particular, the i nvention comprises certain 
synthetic RNA catalysts capable of cleaving an RNA substrate : which ; contains; the sequence 5 -Fi -CS-F2-3 . ■ 

so "Synthetic RNA catalyst," as used herein, rheans-a; catalyst: which is^ catalyst, 
although "synthetic catalysts" may: be truncated : or a|tered ; versions : qf ;:• naturallyroccuring catalysts. : 
"Synthetic catalysts " include catalysts synthesized in vitro and catalysts syhthesizBd: in vivo. In particular, : ; : ; : 
"synthetic catalysts" include catalysts produced hosts transformed by a vector: 'comprising a sequence 
/ coding for the cataly^ :• ; : :: . ^ ;;: : ^:^ : , 

55 • • RNA of any; ieng^-arid type may::.be us^ the: substrate as long as it ■ contains; the! ; £;rFr-CS-F2-3 
target sequencer^ 

at which the catalyst cleaves the substrate), GS is a short sequence: of ^ base pair with 

the RNA catalyst, and GS preferably has tfie - siequence 5 >N(5UC-3v wherein N - is 
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substrate is ; cleaved ;l by the catalyst between N and G to produce a fragment having an OH at the 5 end 
and a fragment havlng a 2-,3'^c at the 3' end. : 

■ CS is - flanked by two short base-; sequences F\ and F2 which do base pair with the RNA catalyst. Ft is 
preferably at Ieast : 3 Abases in length^ most preferably ;4 bases In length . is - also preferably at least 3 
5 bases; in length, most preferably 6 to 12 bases in length;' 

Catalysts according to the invention comprise a substrate binding portion and a "hairpin" 'portion.: The 
substrate binding portion of the catalyst has the sequence: 

3>i-Li>3-5'.- ; 

wherein, 

io F 3 is a sequence of bases selected so that Fa ; is substantially base paired with F2 : when the catalyst, is 
bound to. the 'substrate; : 

F>: is : a : sequence of bases selected : so,. -thai, F+ is sub^ntiaftyvbase : pai red with Fy when the catalyst is ; 

. bound to the ; substrate; . U : ^:t 
i : the sequences ^. W 

rs : sufficient binding of; the RNA substrate -to the RN A catalyst; so thai : cie^aget i of the substrate can take place; . 

and ;: 7:; : '-' : r.. '"T. •'• ■ ■ ■ • "\ • '.' - 'V'- : ---f7 : ^-V; ; .;'- 'J'^y : '. : --V:-- ■ ■ : : . 

Lt is a sequence of baises: selected so that Li does not base pair wim\fe is bound to the 

substrate. . 

As used herein, "substantially base paired" means: that greater than :65% of the bases of the two RNA 
20 sequences in question are base paired, and preferably greater than 75% of the bases are base paired. 
"Substantially unpaired" means that greater than 65% of the bases of the two sequences in question are 
not base paired, and preferably greater than 75% of the bases are not paired. 

F 3 is preferably at least 3 bases in length, most preferably from 6 to 12 bases in length. F* is 
preferably from 3 to 5 bases in length, most preferably 4 bases in length .Li is a short sequence of bases : 
25 , which- preferably has the sequence ■'5'-AGAA-3' when CS has the sequence 5-NGUC-3' . //Cv.:^ 

Tl^ portion of the catalyst whic^ configuration when th6 

: ; ;: sub^ate modeled in two dimensions for ;;minirhum energy folding. This is Shown in 

: Rgur^ RNA. The "hairpin" portion; (s net an the sense that- not all 

: bases' of the "hairpin " portion are base-paired: Ind^d, it i is preferablei : perhaps necessary, for the "hajrpih" 
30 portion to have ^ le^t one substantially unp assume a tertiary structure 

that allows for better, or optimal, catalytic activity. : r 

The "hairpin" portion of the catalyst preferably has the: sequence: 



35 



40 



r P 3^2- P 4- 3 ' 



L 2 



■'.::;; wherein ' ". ; ■ ■ : j (.>;• ' ■ & % ' ■ " ■ .'; ' ' i; \ ; ;WIy I; f--B ■ I , ■ ?' ; ■ :; : : ; : : )■ : . ':- ';: l 

Pi and Pd each is a sequence of bases; the sequ of Pi and P* being selected 

substantially base paired; :: 

Pi is covalently attached to F*; . 
45 Si arid S2 each is a sequence of bases, the sequences of Si and S2 being selected so that Si and S2 are 

substantially unpaired; 

P 2 and P 3 each is a sequence of bases; the sequences of P2 and P3 being selected so that P2 and P3 are 
substantially base paired; and 
L 2 is a sequence of unpaired bases. 
50 "Substantially base paired" and "substantially unpaired" have the same meanings as discussed above. 
P< and P4 each ;is preferably, from 3 to 6 bases in length, and most preferably Pi has the sequence 5 - 
: : ACCAG-3Vand Pi has the : sequehce 5 / -CUGGUA-3\ 

: 4 to 7 bases in lengthv and most preferably Si has the sequence 5 - 

AGAA-3':;ahd:iS2 has the; sequence 5 -AUAUUA^ 
56 P 2 arid Pa each is preferably from 3- to 6 bases in : length, and most preferably P2 has the sequence :5'? 
. : ACACAC-3' and P3 has the sequence 5'-GUGGU-3'. :- 

Finally, La is preferably at least 3. bases in length and most preferably has the sequence 5-GUU-3^ 
Further, 5'':S't-Pi-L2 preferably has the sequence s'-AGMACAGACGUU-s'. ; 



EP0360257A2 



The specific preferred sequences set forth above for Pi, Pal, Si, etc., are from the catalytic sequence; of 
(-)sTRSV RNA. 

The most preferred catalyst contains the sequence: 
5'-F 3 -Li-F^ACCAQAGAAACACACGUUGUGGUAUAUUACCUQGUA-3' ( , 
5 and active variants thereof, wherein F 3l F* and Li are as defined above. As used herein "active variants; 
means catalysts. which, although having substitutions, deletions and/or additions of bases as compared to 
the original sequence, are still capable of cleaving an RNA substrate. 

Another preferred catalyst according to the invention is ; : an autocatalytic catalyst containing the 
sequence:'^ 

: ^P 2 — s,— p,— f 4 — l,— f 3 ^ 



wherein, Ft, F 2 , F 3 , F*. Li, L 2 , Si, S2. Pi. P2, P3 and P* are as defined above. Thus, the ^l^ie;cohtain$ 
a substrate portion (5'-Fi-CS-F 2 -3') and a catalytic portion (5 -F3-L1 -F4-Pi-Si-p2-l2-P3-S2>P*-3-) coyatently 
linked together by U so as to produce a synthetic autocatalytic RNA catalyst U is a sequence of unpaired 
bases, and U preferably has the sequence-3^bcypp-5 . 

After being transcribed, this catalyst" will : ; spontaneously undergo an intramolecular autbc^ 
cleavage at CS. The effect of this intr^olecular cleavage is to autocatalytically terminate any RNA 
25 transcript in which the sequence is inserted: : 

The Invention further provides an engineered DNA molecule and a vector comprising a DNA sequence 
coding for an RNA catalyst of the invention. Also provided are host cells which have been transformed with : 
the vectors and which are capable of expressing the RNA substrate. Rnally, the invention provides a 
method of cleaving an RNA substrate which contains the sequence 5'-Fi-C$-F 2 -3 , the method comprising 
30 contacting the substrate witha synthetic RNA catalyst according to the invention. 

The Invention is further described below with particular reference to the catalytic properties of (-)sTRSV 
and the structure of its catalytic complex, but the invention is not limited thereto. In addition to the particular: 
: catalytic sequences shown and described activity to cleave an 
•x : RN A substrate can be read! ly found by applying the principles set forth in this specification. 
05 : For example, RNA sequences ^having the required ; structural features for cleaving an RNA substrate can 
be identified by: applying the Wisconsin RNA; Folding Prc^ram discussed above to (1) known sequences of 
: molecules haying; ^ in which the actual location of the 

catalytic i site is ^ pairing regions and lengths, 

and (3); random ly modified known catalytic - sequences, - while ; looking for known features of a catalytic 
40 molecule, such as the "hairpin" configuration of the catalytic complex: when mpSeledj in two dimensions. As 
a specific example, information regarding known autocatalytic cleavage sites canxb ; 
i : binding- ^sequences havjng^ 
adjacent a "hairpin" .porti<^. .SecoW^ such as the two substantially paired regions ; in : th? % • . 

: " 

45 be looked for, either by manual examination or with automated computer programs^ order : of decreasing | : ; : 
preference, the ^ 

that can base pair (form helices) with the regions of the substrate ; RN A molecule • f l^l<ing the : ; cleava^ 
sequence; (2) an unpaired (loop) region opposite the cleavage sequence of : the ; substrat^ 

; s^ 

50 unpaired region between two substantially - paired regions i n the "hai rpin " structure; and (5) : a loop : : 
connecting the two strands of a substantially "base paired region to complete the "hairpin" structure! : 
Standard techniques of in vitro RNA synthesis can then be used to prepare actual : molecules having the ; 
sequence that gives the"predicted two-dimensional computer-generated structure, for verification of activity 
and routine testing of variation to determine optimum sequence; 
55 . The catalysts of the present invention were developed using a "hairpin" model or motif of: RNA 
: catalysis,. According to : this model, the catalytic complex, when modeled in two dimensions . for minimum 
energy folding, assumes a "hairpin" configuration. This is shown in. Figure: 1 for (-jsTRSV |RNA. -The • 
: ■ •:. catalytic complex is a -'complex of - the minimum, or substantially the minimum,- sequence of the catalyst: , 
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necessary for activity and the minimum, or substantially the minimum, target sequence of the substrate. 
The "hairpin" configuration is hot an. absolute hairpin in the sense that not. all the bases that make up this 
"hairpin" configuration are base-paired, indeed/ there, preferably regions of unpaired bases and of 
substantially unpaired bases as discussed in detail elsewhere in'^e present application. ; 

s The "hairpin" model has proved very useful in designing new catalysts, but it is still only a computer 
model of the likely secondary structure of catalytic complexes involving catalysts according to thie pre^ 
invention. Also , it is ultimately the tertiary structure of RNA catalysts that determines their activity/ For the^ : 
reasons, all catalysts having the properties described herein are considered to come within the scope of the 
present invention, even If they do not form a "hairpin" configuration when complexed with the substrate and 

to even if they do not contain a "hairpin" portion. For instance, it may be possible to engineer a catalyst 
having the properties described herein, which does not have a loop L2. Such a catalyst would be considered 
to be fully equivalent to the catalysts described and claimed herein. 

As described in Example 1 , a catalytic complex was identified within the (-) strand of tobacco ringspot 
virus satellite RNA (sTRSV) when the : molecule was; folded using computer models to determine the v.;- 

is - minimum energy folding in two-dimensional space. iThe; (0 strand : is a 359 base long RNA of defined 
. sequence; and is knw^ 

; and Bruening, G: B\ (1986) Virology, 151: 1^-185;^ : Geriach) and Bruening^ G. (1 986) 

£i5*H££ r i^T 1 © (-) strand cleaves at a ^defined j^slte = (ApG)}intq having . ain OH : ; : 

at the 5 end and a 2 ,3' cyclic phosphate at the 3'- ^ present time, hovyeveir, little work had 

20 been done with the (-) strand to find the catalytic;;complex; and to determine the minimum cleavage 
sequences. because It does not fit the consensus "hammerhead" model. . 

In view of the above and the fact that the catalytic center would contain both the catalytic RNA 
sequence and the substrate (target) RNA sequence and by studying the results of Example 1, a 50 
nucleotide sequence between bases 175 and 224 was picked and a 14 nucleotide sequence between bases > 

25 40 and 53 was picked. Using methodologies found in published procedures, a catalytic RNA having a 
satellite RNA base sequence identical to the base: sequence in naturally occurring (-)sTRSV between bases 
175 and 224 was .transcribed: from chemjcally ; synthesized RNA polymerase as 

described |n Example 2. An RNA substrate having : a satellite: RNA base; sequence identical to the base 
sequence in naturally occunihg (-)sTRSy RNA between bases 40 -and 53 : ^ 

30; manner. When the newly, synthesized RNAs werei mixed together under appropriate conditions as described 
in Example 3, the catalytic RNA cleaved ;th^ describ^- in Example 4, the first RNA ;•:';:> 

catalyst fitting the "hairpin" motif was discove 50-base catalytic RNA and the 

14-base substrate RNA were modeled in two-dimensional space using computer modeling. 

The reaction of catalysts fitting the "hairpin": motif with an appropriate substrate proved to be an 

35 excellent catalytic reaction under physiological conditions; The reaction of the catalyst and substrate 
containing the sequences of (-)sTRSV shown in Figure 1 gave a Km of 0.03uM {see Example 5), which is 
20 times smaller than that of the Km for a catalyst fitting the "hammerhead" model (Uhlenbeck, O. C. . 
(1987) Nature 328 : 596-600) and accounts for its; ability to remove target RNA molecules to much lower , 
levels (20 times lower) than that of catalysts fitting the "hammerhead" Imddel. In addition, the kcat for the 

40 reaction is : 2.1 /m in at 37* C, which is at least ; 6 tim^ of : a catalyst haying the ; 

, "hammerhead";; co^ for a ; 

catalyst that fits the -"hairpin" model can be further improved by increasing the amount of base: pairing : : : 
between the substrate and catalyst (see Example: £1)^;:: :: ;\ 

Catalytic cleavage of the substrate RNA occurs over a broad pH range, preferably 5:5 to 8.0, and in the. 

45 presence of divalent ions such as Mg** t: ;e.g. from MgCI 2 . As would be expected 1 for a base catalyzed 
reaction, the rate of reaction increased with increasing pH. The reaction rate also increased with increasing 
concentration of divaient cations as shown in Example 8. 

The reaction takes place at physiological temperatures, preferably 16* C to 45* C, with a temperature . 
optimum at 37 *C as described. in Example 6. Temperatures above about 45' C. inactivate the reaction. Also, 

so the temperature optimum of the reaction is affected by the degree of base pairing between the substrate 
and catalyst (see Example 18). In particular, the length of the region of the catalyst that base pairs with the 
3 region of the: substrate flanking the cleavage sequence can be varied so that an engineered catalyst 
reacting at a desired temperature can; be obtained (see Example 1 8). ; 

; The 50 base catalytic RNA,: configured in the "hairpin" model in Fig. 1 Is the minimal- sequence, or 
55 substantially the minimal sequence, of (-)sTRSV RNA necessary to achieve catalysis. When the 3' terminal 
A (base 175) or AU (bases 175 and 176) were removed, catalytic activity was deceased by 75% as shown./ 
in Example 10. The 5' end of the molecule cannot be; substantially changed either without affecting catalytic 
activity because it is needed to provide base pairing with the substrate. It can be shortened by at most 
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about 3 bases (see Example 9). Experiments which rem^ 

RNA and llgated base 194 to base: 204 produced an Inactive catalytic RNA as described in: Example 11 ; : : :,. 
This shows that all or part bf ; the; ;| region between bases ;i 95-203; is essential pr^\B\^c activity; :; An 
additional mutagenesis e^erlm three A- bases : at positions 203; 2p2 

5 . arid 201 ' was dphe, When these baises were changed to 
resulting catalytic RNA was inactive. 

^Base changes can be made in the two base paired regions of the:"hairpiri rt portion of the catalytic: RNA, 
as long as substantial base pairing is maintained. This is shown in Example 22 where base changes that 
destroyed/base pairing in these two regions resulted in inactive catalysts. When a second base change was ; 
w made which restored base pairing, the catalytic activity was also restored (see Rgure : 19); 

Base changes cian also be made in the two . regions of the catalytic RNA that base pair with the 
substrate, as long as substantial base pairing with the substrate in . the regions flanking the.cleavage 
sequence is maintained and base pairing with the cleavage .sequence is. avoided as shown in BcamjDles 9 
and , 16-21. It is this feature that gives the RNA catalyst flexibility to be engineered to cut a specific target 
is RNA substrate; having a cleavage sequence such as NGUC.;,This is illustrated in Example 16 where the ; 
catalytic RNA wais engineered by changing base 21 4 from a Q to a C resulting in a catalytic RNA which 
failed to react with the : subs^^ RNA developed f rom : =• natural (-)sTRS V RNA. Activity : was restored, 
however, when the substrate RNA was changed so that Mt could base pair with the subject engineered .. 
catalytic RNA. Also see Examples 9 and 17-21. 
20 The (-)sTRSV catalytic RNA sequence has an 5-AGAA-3 r sequence opposite trie AGUC cleavage 
sequence of the substrate. As shown in Example 24, at ledst part of this AGM sequence seems to be 
invariant In particular, the base at position 217 in the AG AA loop was changed from a G to a C, and the 
base at position 216 was changed from A to U (see Rgure 21 ). As reported in Example-24, these changes 
destroyed the activity of the catalyst, i ■' 
25 ; : The target RNA substi^e of tha^ w 

: i loop which does not base pair to the ciatalytic RNA In two-dimensional space. A? ; shown; in Examples 1 3-15, 
: *A M in the the AGUC cleavage sequence lean be changed to any other base \ without effecting the ability of 
i . the RNA catalyst to cleave; the ;su^ shows that the; GUC - sequence; if conserved. Thus, 

• : ■ :;: the bnly sequence; requi 

30 The cleavage sequence has four flanking bases at its* 5' end and six at i^3' • end . which base pair with . 
the catalytic RNA. As described above, the bases in the \ flanking regions can be changed without affecti ng 
the ability of the catalytic RNA to cleave the substrate as long as sufficient base pairing with the catalyst is 
maintained in the flanking regions. This would be ex pected to work on longer RNA substrate sequences of 
any length as long as these criteria are met. Indeed, lengthening the 3 region of the substrate that base 
33 pairs with the catalyst has been found to provide a more efficient catalytic reaction. See Examples 18 and 
21. However, a smaller 10 base substrate having three flanking bases at its 5 end and three flanking bases 
at Its 3' end did not work as well as the 14-base substrate, as described in Example 9, 

Using the "hairpin" model as a guide, RNA catalysts can be engineered that base pair with an RNA 
substrate, and mediate a cleavage: in the cleavage sequence. In particular, catalytic: RNA can be engineered 
40 that • will cleave; any RNA substrate having a cleavage sequence, such as NGUC, and flanking regions with 
: which;^ 

"hairpin" motif. To do this, the bases flanking the cleavage sequence must be identified and the catalytic 
RNA engineered so that it does not pair in two-dimensional space with the cleavage sequence/ but does pair 
with adequate numbers of flanking bases upstream and downstream of the cleavage sequence. 

45 As shown in Examples 17-20, catalytic RNAs according to the invention can ^cleave specific viral and 
messenger RNA sequences. In Example 15, tobacco mosaic virus (TMV) RNA' containing the 5 coding ; 
region of the repiicase gene was targeted for specific cleavage by an appropriately engineered catalytic ■ 
RNA. The target sequence contained changes In 8 of the 14 bases of the substrate RNA having a base 
sequence found within the catalytic complex of (-)sTRSV RNA and was cleaved by the engineered RNA. . 

so . catalyst under conditions near physiological. Catalytic RNAs were also designed and synthesized using the; :, 
"hairpin" model: . as a guide which could cleave sequences: from messenger RNA; Codin^^ for; chloram- : 
phenicol acetyl transferase (Example 1 8) and from HI V-1 viral RNA coding for the gag and the tat proteins , 

; : : (Exiampies 19 and 20). ;• 

These examples demonstrate that the system can . be used to specif ically cleave • ah RNA sequence . for 

ss .which an appropriately engineered /catalytic RNA base : pairs : : at the ^ designated ; flying regions/ Suitable-.; ;. 
target RNA substrates include -viral, messenger; transfer, ribosomaf; nuclear, organella^ RNA,; ;, 

or any other natural RNA having, a cleavage sequence, as well as RNAs which have been engineered toi; ; 
contain an appropriate cleavage sequence. 
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Catalysts that fit the "hairpin" catalytic RNA modei are expected to be usefulin vivo in prokaryotes or. 
eukaryotes of plant or animal origin for controlling viral infections or for regulating the expression of specific 
genes. In this case, a cleavage sequence such as NGUC in the virus or complementary to NGUC in the 
gene would need to be Identified along with the flanking sequences immediately upstream and downstream 

5 of the cleavage sequence. Normally three to four bases on the 5 side of the cleavage sequence and 
enough bases in the order of 6 to 12 on the 3 side to provide adequate binding of the catalytic RNA and to 
provide reasonable certainty that the target RNA sequence is unique in the organism are all that is required. 
: : A catalytic RNA. is then .engineered. Which does not base pair with the cleavage sequence and which 
does base pair to the flanking: regions on the 5' and 3 ; side of the cleayaige sequence. A DNA template 

to corresppnding to this cat^ytic : RNA is thea s the art. . 

Such procedures include the phosphoramidite method. ■ . 

The invention also includes an . engineered DNA molecule and a vector comprising a sequence coding ". 
for the desired RNA catalyst The vector will have the DNA sequence coding for the desired catalytic RNA 
operatively linked to appropriate expression control sequences.: Methods of effecting this operative linkihg, 

75 either before or after the DNA coding for the catalyst is inserted : into the vector, are well known. Expression 
control sequences include promoters, activators, enhancers, operators, : stop signals, cap signals, 
polyadenyiatiorv signals, and other signals involved with the control of transcription/ . 

The vector must contain a promoter and a transcription termination signal, both operatively linked to the ^ 
synthetic DNA sequence, Le., the promoter is upstream of the synthetic: DNA sequence and the termiriatioh • : : 

20 signal is downstream from it. The promoter : may be any DNA sequence ' that shows transcriptional activity: In : 
the host cell and may: be derived from genes J encoding homologous: or- hiBterblogous proteins; and either : 
exfracellular or intracellular amylase, glyco^yl^es,5pr6teases; : : lip^ : 

glycolytic enzymes. Also, a promoter: recognized T7 RNA polymerase may : be used if the; host is also .: 
: engineered to contain the gene coding for T7 RNA polymerase. 

26 The prompter may contain upstream or downstream activator: and enhancer sequences. Ah : operator : : 
sequence may also be included downstream of the promoter, if desired, i : 

Expression control sequences suitable for use in the invention aria well known. They include thoise of 
the E coli lac system, the E. coli trp system, the TAC system and the TRC system; the major operator and 
promoter regions of bacteriophage lambda; the control region of filamentous single-stranded DNA phages; 

30 the expression control sequences of other bacteria; promoters derived from genes coding for Sac- 
charomyces cerevisiae TPi, ADH, PGK and alpha-factor; promoters derived from genes coding for. the 
Aspergillus oryzae TAKA amylase and A. niger glycoamylase,; neutral alpha--amylase and acid stable alpha- 
amylase; promoters derived from Rhizornucor miehei aspartic p'rpfe'in'ase and lipase; 

mouse mammary tumor promoter;: SV40 promoter; : the actin promoter; and other: sequences known to 

(35 control the expression of genes of prokaryotic cells, eukaryotjc celis, their viruses, or combinations thereof. : i •: 
■: : ' The v^tor must also contain one or more replication systems which aliow it to repiicate in the host • • 
cells. In particular, when the: host is a yeast, the vector should contain the yeast 2u; replication gehe^ 
3 and origin of replication; : ; 

The vector should further include one or more restriction enzyme sites for inserting the DNA template 

40 sequences into the vector, and preferably contains a DNA : sequence coding for a ; selectable or identifiable \ 
phenotypic trait which is manifested when the vector is present in the host cell ("a selection: marker"). 

Suitable vectors for use in the invention ::are well known. They :;in^ 
vectors, pUC (such as pUC8 and pUC4K), pBR (such as pBR322 and pBR32^ as pUR288), 

phage lambda; YEp (such as YEp24) plasmids,; and derivatives of these vectors. 

45 The resulting vector: having: the synthetic 

transform an appropriate host. This transformation : may be performed using methods well known 

Any of ; a large : number of available and well-known host cells may be : used in : the - practice of - this 
invention: The selection of a particular host is dejDendent upon a number of factors recognized by the: art. 
These include, for example, compatibility with the chosen expression vector, toxicity: to ; it of the catalytic : 

so RNA encoded.for by. the synthetic DNA sequence, rate of trans formation , exp ressioh characteristics, bio- 
safety and costs. A balance of these factors must be struck with the understanding that not all hosts may be - 
equally effective for the expression of a particular catalytic RNA, 

Within these general guidelines, useful hosts include bacteria (such as E. coli sp.) ( yeast (such as 
Saccharomyces sp.) and other fungi, insects, plants, animals (including human)7or"otner hosts known In the 

55 art. 

As an example of the general genetic engineering techniques that are possible, mammalian vectors can 
be used to deliver DNA coding for the catalytic RNAs of the invention to animal cells; these vectors should 
have a suitable DNA replication signal, such as from SV40, a promoter which may or may not be inducible, 
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such as the mouse mammary tumor promoter (which is induced by dexamethasone) or the honinducible 
SV40 promoter. A multiple cloning site is located after the promoter, and the ONA coding for the 
engineered catalytic RNA is inserted into this multiple cloning site using standard techniques. If necessary a . .• 
suitable terminator is Inserted, the resulting vector is then put into cells using standard techniques such as 

s direct injection into the nucleus," electroporation, or other well-known transformation techniques. Once the 
■ vector is in the cell, the; catalytic RNA is expressed directly' when noninducible promoters are used, or after 
addition of the inducer when inducible promoters are used.;^;^^ 

Similarly, ; plant vectors, such as the Ti piasmid or micro-Ti plasmids, can be used to deliver ON A 
coding for a desired catalytic RNA to plant cells. The Ti plasmids and micrchti piasmids may be used as; 

w such to transform plant protoplasts using known techniques or may ; be inserted into Agrobacterium ■ 
tumefaciens which is then used to transform plant tissue. Once the pias mid is in the cell, the catalytic RNA 
will be expressed. 

Once transformed, a host cell can express (transcribe) the RNA catalyst. When the DNA coding for the 
catalyst is transcribed, it produces catalytic RNA which will attack and cleave the target RNA sequence or 
is sequences for which it has: been designed, inactivating the RNA. If the RNA is necessary for the life cycle ;j 
of a virus, the virus will be eliminated and if trie RNA is the prodMCt'icrf a specific gene, the expression of 
that gene will thus be regulated. The catalytic RNA can be designed to work in prokaryotes and within the^ ^ 
nucleus -(without polyA tail) or in the cytoplasm of a euk^ 
for plants and animals. 

20 Another potential method of using the catalytic RN As of the -invention is to prepare stable synthetic ;•. -h : 
derivatives of RNA catalysts designed to bind arid cleave a spedflc target RNA and to deiiven^ 
catalysts -directly to the ceil or organism of choice. For example, istahdard : methods are availabie for making 
phoshporothioate derivatives of DNA which have been shown to be very stable in vivo arid to be able to 
bind to a specific DNA or RNA target in vivo (antisense method). A modification of these procedures can be 

25 used to prepare a catalytically active derivative of RNA catalysts prepared according to the invention. This 
would entail determining which ribonucleotide regions can be altered and then altering them with , deoxy, 
phophorothio, or other modifications which confer stability but do not destroy catalytic activity. This 
chemically modified catalytic RNA (which may or may not have any remaining RNA bonds) can then be 
injected or otherwise delivered to an organisrri to control viruses or gene expression. For instance, one of 

30 the catalytic RN As whose preparation is described in Examples 19-20 having specificity for sequences 
fo^ within the RNA; of the HIV-1 virus that causes AIDS could be chemically modified as described, 
encapsulated in a liposome coated with monoclonal antibody directed to the CD4 receptors found on cells 
• susceptible to HIV-1 , and injected into a host animal. : 

The "hairpin" catalytic RNA model . of the present invention; may also be of possible interest to 

35 molecular biologists in exploring the genetic blue-prints of plants and ^ 

sending randomly constructed DNA reverse transcripts of catalytic RNA into the DNA of the organism and 
waiting to see which gene or genes were inactivated. Other techniques could be applied to determine where 
those genes resided on the organism's chromosomes, thereby greatly accelerating gene mapping, 

Finally, a synthetic autocatalytic RNA catalyst has been developed. The synthesis of one such catalyst 

40 based on the (-)sTRSV RNA catalytic and substrate sequences is described in Example 23.; The use of: / 
such autocatalytic; catalysts allows for the efficient autocatalytic termination of any ;RNA transcript in which it 
; is inserted.'- <■ .-• . • ;• \. v \\- : ". ■ ■ >" • 

>''•" EXAMPLES " : : 



The following examples illustrate the invention. 

50 

Example 1 

The (-) sense sequence of satellite RNA from the budblight strain ;of tobacco ringspot virus as shown in 
65 . Fig. 2 was folded using the Wisconsin RNA folding program to identify the location of a possible catalytic 
domplex accounting for its ability to self cleave (University of : Wisconsin Genetics Computer Group, 
■ " Program FOLD 5/6/86) (Zucker, M. and Stiegler, P. (1981) Nucleic Acids Res. , 9: 133-148; Devereux. J., 
Haeberli, P and Smithies, 0. (1984) Nucleic Acids Res., 12: 387-395); Base: numbers correspond; to (+)- 
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sTRSV RNA (Buzayan, J. M., Geriach, W. L, Bruening, ■GKB:, : Keese; P. and Gould, A. ; R. (1 986). Virology ,: i 
151_: 186-199). With, this numbering scheme the s'-3 , direction of the. molecule ;is with decreasing base : • 
number. ■ ' " ■ •• •• '•• ; :. - : ■ ./. •>., "\ ■ ;, •: ;.' . :. " • ^j--;;^^ ••:•-.'. -. .-: 

,^' : i :^ 

s identified regions of ^pected' base; pairing which The folding: . 

also identified expected - non-base pairing loops at or near the; site of cleavage, v This model does not 
preclude higher order interactions occurring between non-adjacent portions of .the catalytic center? 



10 Example 2 



A 50 nucleotide sequence between bases 175 and 224 was picked and a 14 nucleotide sequence 
between bases 40 and 53 was. picked from the catalytic complex identified in Example ;1. A catalytic RNA ■ 

15 (R51) with the 50 base sequence; shown in Fig, 1 plus one additional vector base (G at the 5' end) and a 
substrate RNA (S17) with the 14 base sequence shown in Fig. 1 plus three vector bases (GCG at the 5 
■end) were transcribed using 77 RNA polymerase from synthetic DNA templates double stranded at the 
prompter site (Milligan, J. F.; Groebe, D . R; , Witherel I , G. VvVand Uhlenbeck, 0. C. (1987) Nuc. Acids Res M : : : 
15: 8783-8798). The synthetic DNA templates were made using ph^^ on an Applied 

20 Biosystems 381 A DNA synthesizer. The template ONAs were: catalytic RNA R5 1 : 3 ATT ATGCTG A- 
GTGATATCTTTGTCTCTTC AG TTGGTCTCTTTGTGTG CAAC ACC ATAT AATGG AC CAT 5' and substrate 
RNA S17: 3'aTTATGCTGAGTGATATCGCACTGTCAGGAGAAA 5'. 

Before transcription, a 18mer or 16mer DNA complement to the promoter for T7 RNA polymerase on 
the noncoding strand was hybridized by heating an equimolar . amount of template DNA with promoter 

25 complement to 65 ° C for 3 min. then placing in ice. A jtypical transcription reaction used 8ng/ul DNA : ; 
template, 0,5m M each : NTP, 2mM spermidine, 40mM : Tris pH ■;7:5/ : ;:4%^>poly^tfiViene: glycol 6,000; 6m M v 
MgCI 2 , 4mM NaCI, 10mM Triton X-idp;; 2.4: units/u^ CTP and; 3 ; ; 

units/ui T7 RNA polymerase (US Biochemical) and was run at 37" 90 min. : 

All in vitro transcribed RN As were isolated on 7M urea, ; 1 5-20% acrylamicte gels^ bands cut out and : : • 

30 isolated. All RNAs were sequenced using standard methods (Donis-Keller, H., Maxam; A. M. ^d Gilbert, W.^ !;i 
(1980) Nucleic Acids Res., 4:; 2527-2538); a method- which alsoigave the 5 ^ terminal base. 
at the 3 end were determined by ligation of the RNA to 5' P^ 2 pCp using T4 RNA ligase (BRL methods 
manual), nuclease T2 digestion, and separation of labelled bases by PEI thin layer chromatography in 0.3M : 
LiCI with appropriate standards. All RNA sequences corresponded to that expected from the DtiA template. 



Example 3 

40 : The catalytic: RNA R51 was added to. the substrate RNA $17 at a ratio of 1 :30 and the time course of v : 
; substrate RNA cleavage was studied .The reaction was carried out at 37* C in 1 2mM MgCb, 40mM Tris pH . 
7.5 and 2mM spermidine over a time period of 150 minand is summarized :as follows: 

45 S'gcgUGACAGUCCUGUUU 3' R51 gcgUGUCA + GUCCUGUUU 

(S17) (5'F) (3'F) 

The reaction products were separated on poly-acrylamide/urea gels ;by electrophoresis and bands cut 
50 out and counted in a liquid scintillation counter. The results are shown in Rg. 3. The time periods;: analyzed •;• 
. were: lane 1 ,30 sec; lane: 2, 5 min; lane 3, 15 min; lane 4, 30 min; lane 5; : 60 mih; lane 6. 90 mi hi ^d lane ; V 
7, 150 min. Beginning concentrations were as follows: R51- = o;b032uM and: ST7.=0.09uM; : • 

■ : As shown in Fig. 3,; the cleavage proceeds; to. virtual completion during the . course of the reaction .with.:; 
only 2% of the substrate remaining after 150 minutes; This; shows that, since there: was originally 30: times"-' 
55 as much substrate RNA as catalytic RNA, the RNA catalyst R51 of, necessity interacts with multiple ; . 
substrate molecules: during the course of the reaction. In addition, the amount of catalyst - remained the 
same and was unaltered, indicating that R51 is truly a catalytic entity. 
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Example 4 : v: 

After the RNA catalyst had been shown to be eff^tiye Inx described in ; 

s : Example;3, minimum energy of the 60 b^e sequence shown , complexed with the:1 4 base 
sequence^ was done using the computer meth<^s described in Exam complex forms a,: 

"hairpin" model or - motif as shown in Fig. 1 with the substrate RNA sequence /and the cataiytic R^A : 
sequence identified. The arrow Is at the site of cleavage. 

70 

Example 5 

Various concentrations of substrate S1 7 were used at constant concentration of catalyst R51 . and initial 
is : velocities : of each reaction determined, The reaction was carried out at 37* C in 12mM MgCb, 40mM Tris 
pH 7.5 andimM spermidine. Concentrations were as follows: R51 =0.0004uM and S17 3 0.125(jM (lane 1), 
0.0824uM (lane 2^ (lane 4), 6.021 uM (lane 5), 0.01 56uM (lane 6), 0.0078uM 

(lane 7) and 0.0039uM (lane 8). Each ; reaction was analyzed on poiyacrylamide : gels :as described in : 
Example 3 with the results shown in Fig. 4, An Eadie Hofstee plot of catalytic RNA R51 cleavage of 
20 substrate RNA $17 is. shown in Fig. 4. The reaction proceeded according to the predictions of the 
Michaelis-Menten equation indicating that it was a truly enzymatic reaction in thatj'as tha concentration of 
the substrate goes down, the velocity of the reaction goes down; From the velocity of -the reaction 'plotted 
as a function of substrate concentration, the Km calculated from the graph, was 0.03uM : and: the kcat 
(turnover number) was 2.1/min. This Km is 20 times smaller than the "hammerhead catalysts ^ 
25 Uhienbeck, 0. C. (1 987) Nature , 328: 596-600) indicating that lower concentrations of substrate can be 
removed. The kcat : is 6 times larger than that of catalysts fitting the "ham merhead " : model : at 37°b (see i • 
Uhienbeck, 0. C; (i 987) Nature, 328: 596-600), meaning that the- reaction- is 6 timiss faster. ■ : : : : 

. jo ;■ ... ■ • - ■ Example 6 . . 

The temperature dependence of the rate of cleavage of substrate RNA S17 by catalytic RNA R51 was ; 
tested over a temperature range and the reaction products analyzed on pblyacrylamide gels as described ih: : 
35 Example 3 with the results shown in Fig. 5. The reaction was carried out in 12mM MgCfe, 40m M Tris pH 
7.5 and 2m M spermidine at 45°C (lane 6), 41 *C (lane 5), 37 'C (lane 4), 33*G (lane 3), 27 *C (lane 2) and 
20 V C (lane 1). The concentrations used were: R51 - 0.001 6uM and S17 = 0.04uM. R51 was unlabeled. The :■■ 
velocities shown in the graph, in Figure 5 were calculated by the use of time points of 8 and 16 minutes. ; 
The separation patterns on the gel shown in Figure 5 is for the 16-minute time point. ; 
40 : dependence similar to that which would be expected of ia reaction 

involving base paired RNA molecules. The Arrhenius plot of the data shown in Fig . 5 gives a temperature 
optimum of 37* C for the reaction. Higher, temperatures reduce the reaction rate with a; very rapid rate \ 
reduction above 41 * C consistent with a melting out of the catalytic RNA structure. At 50 * C;no reaction was 
: detectable. The reaction rate at temperatures below 37 ' ■ C showed a linear reciprocal tern perature depen- 
ds dence consistent with a classical lowering of the energy of activation for the reaction. The; slope ;of the 
. in the Arrhenius plot gave an energy of activation of 1 9 Kcal/mqle which:; is close to that: found for catalysts 
fitting the "hammerhead" cleavage mechanism (1 3.1 Kcal/mole) (Uhienbeck, 6. C. (1 987} Nature , 328: 596- 
' 600). ' _ : ": . -.• \.;': : : ,::. : (,,V.:'' :/; = - " : 

■ ' Example 7 

The rate of cleavage of a constant concentration of substrate RNA S17 at varying concentrations 6f ; . ; 
55 catalytic RNA R51 was tested and the reaction products analyzed on polyacrylamlde gels as described in ; : ... 
■ Example 3 with the results shown in Fig, 6. The reaction was carried out at 37* C in;12mM MgCk, 40mM: . 
Tris pH 7.5 arid 2m M spermidine for 40 min (lane 1 and 2), 20 min (lane 3), 10 min (lane 4) and 5 min (lane 
5). The concentration of substrate was 0.175uM. The results are plotted in Fig. 6 and show that at saturating 
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concentrations of substrate the reaction rate is linear with increasing RNA catalyst concentrations as one 
would expect for a true catalytic reaction. 



Example 8 



The effect of Mg : ; : ; concentration: and; pH : ;: qh the rate - of cleavage of RNA substrate S17 by RNA 
catalyst R51 was determined as : shown in the' following : table:;:-:. ;: : .; 



70 



75 



20 



25; 



30 



35 



40 



45 



50 



55 



'fyigCl2(m 


^:v-^' : ;;ti/z(mih) . 




no detectable product:? 


■ A :: 


: . 1 38-2 !: 


■ : 6 


■ - 111 •:• 


8 


■ ' 115 


10 


- 88 


12 


81 


15 


74 


20 


62 


PH 


ti*(min) 


5.5 


330 


6.0 


:;M20:-: 


". 6.5 




7.0 


: ;48:}- 


7.5 




8.6 V 


•i" 38 



In the: Mg = studies, the substrate $17 concentration was 0.14uM and RNA catalyst; R51 concentration 
was 0.001 5uM. The reactions were at 37* ^ pH 7.5. In the pH studies, the substrate S1 7 

concentration : was 0^ was 0.001 4uM. the reactions were at 

37* C in 40m M iTris jf or p H 7.0, 7.5 , 8.0 and in 40m M Pipes for pH 5.5, 6.0 and 6.5^ 

The dependence:- of; the reaction r^e on Mg** and pH are virtually identical with those of catalysts • 
; fitting; the "harn^ 
basei Catalyzed ;;^r^ 

• increase in [GHp] betwe^jpH ) 6^; : ahcl ;;^ 0 ; : resu lied -'in; only a 3 foid increase in the reaction rate. ; 



: Example 9; 



A 10 base substrate (S10) was prepared by the methodology of Example 2. When the substrate was 
mixed with catalytic RNA R51, the. reaction is summarized ais follows: 



5'GACAGUCCUG 3' 
(S10) 



R51 



GACA 
(5 f F) 



GUCCUG 
(3*F> 



: The results of ^ :S i 6 ' corn parable> to those described With : S 17 in Example 5 

showed a Km = 0.06uiM and a kcat = 6.8/min; ; These results are shown in; F\& 7i and ' indicate ;that : Smaller: 
substrates can be used, but not as efficiently. 



Example 10 
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The 3'; terminal base of the catalyst shown in thb : "hairpin" model of the (-)sTRSV catalytic complex in . ■ 
Rg. 1 is at position '175! 'Two catalytic RNAs were prepared with exactiythe same sequence as R51, except ■ 
that one of them did not contain the 3' terminal "A" base (position 175) and the other one did not contain : 
the 3 terminal "UA" sequence (positions 176 and 175). Synthesis of these catalytic RNAs, designated R50 

5 and R49, respectively, was carried out as described in Example 2. R50 or R49 catalytic RNA was mixed 
with substrate RNA S1 7 under standard conditions of reaction and the products analyzed as described in 
Example 3, The results are given in Fig. 8 and show a 75% reduction in activity with either R50 or R49 as 
compared to the activity of catalytic RNA R51 having the 3 terminal "A" at position 175 and the 3' terminal 
"AU" at positions 175 and 176. 

id ' : ••. . • .• . - ' : ^r :., .. : '. . --.^v ■ ■ • • . ■ ■ . ■ . ■' ■ - 

Example 11 

is An RNA with the: same sequence as catalytic RNA RSI was prepared/except that bases 

omitted such that base 194 was in effect ligated to base 204. This RNA molecule was -prepared as :.■ 
described in Example 2 from an appropriate ONA template contain ing the complementary sequent 
this RNA was mixed with substrate RNA S1 7 as described in Example 3, no reaction occurred; These 
results show that major elements of Ihe "hairpin" ^ucture are required for RNA catalysis to occur ; and that ;- : 

20 removal of 9 bases (see Rg. 9) in the middle inac^ 

' ■ ' ■ i Example 12 : ; ■ ■ ■ : •••• V ■ 

25 ' "'- . . . V "• ' • : ; : '■. • ; ; ' : •'• ; • : ... 

An RNA with the same sequence as R51, except that the bases AAA at positions 203, 202 and 201 : k 
were changed to CGU, respectively, was prepared as described In Example 2 using an appropriate DNA :, 
. primer. When this potential RNA catalyst was mixed with substrate RNA S17 as described in Example 3, no ; 
reaction occurred. This shows that the integrity of one or ail of these bases (see Rg. 10) is required for 
30 catalytic activity. 

Example 13 

A substrate RNA with the: base : at position ; 49 in Rg. 1 changed from ah "A" to;a "G" was prepared as ; 
described in Example 2. When this subWate was reacted with the RNA catalyst R51 , no difference in rate 
of reaction was seen ' between this substrate and the substrate containing the "A" at position 49 (see Rg. ; 
11 A). This shows that alterations can occur in the "A" : base in the substrate RNA AGUC loop without . / 
40 affecting the ability of the catalytic RNA to cleave the substrate. 

Example 14 

Another substrate RNA identical to S1 7^ 
} preparedias described in :;E 
o!l2uM; was reacted with the catalytic: RNA R51 , at a concentration of 6.0085uM, under standard conditions 
as described in Example 3 for 60 minutes. The resultsare shown in Figure 11B where Lane i cont^ns the 
so products of the reaction of substrate RNA S17(A«>U)-with R51 catalytic RNA. No difference in the rate; of 
reaction was seen between S17(A-->U) substrate RNA and substrate $17 containing the "A" base at ■= 
position 49. 

55 ' Example 15 

Another substrate RNA identical to S17 but having "A" : replaced by "C" in the AGUC loop was 
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prepared as described in Example; 2 (designated T7(A^i> C) "); j;This 'substrate'. 1 RN A;- : at r a^concentratibriv of ■ --'4 
0.08GM, was! reacted; with the catalytic RNA R51 r at a cohcehtratipn of 0,0065uM, under standard; con^^ 
as described jh ; Exarnpie 3 for 60 minutes. The results are shown in -Rgbre contains the 

products of the! reaction of S17(A-->C), substrate: RNA with R51 catalytic 1 RNA.- : Again - no : : difference w^ : ^ : : 
5 - seen in the rate, of; reaction using S17(A->C) as compared to:^ ■ v / 

The combined results of Examples 1 3-1 5. show that the- base at position 49 ^ 
substrate may be any base. 

10 Example 16 



Base changes in the stem regions at the site of binding of the substrate RNA to the catalytic RNA in the 
"hairpin" configuration can be made as long as the base pairing is maintained. The "C" base at position 50 

/5 of the substrate was changed to a "G" using the methods in Example 2. When this substrate RNA was • . 
reacted with the catalytic RNA R51 , no cleavage of this substrate ; occurred. However, when a new^ catalytic 
RNA, containing ;^^ : 0 
methods in Example^; and ad full cle avage was • seen. The -effebt of -the base change : 

from "C" to "G". in the substrate was to eliminate the base pairing at this position- p^ "hairpin" ; ^ 

20 model since now a ;, 'G" would be across from a "G". However, when a "G" to "C n 'base change was made 
in the catalytic RNA, the bas$ pairing was restored, but in a reverse manner, and the integrity of the helices :. 
in the stem regions where the substrate. RNA binds to the catalytic RNA was thus conserved restoring ; : : 
catalytic activity (see Fig. 12). 

Example 17 • • 

An RNA sequence : found within ^ 
so methods described in ■ 
3 . A catalytic RNA : was synthesized with the sequence ^^5 

ACGUUGUGGUAUAUUACCUGGUA3' selected so that base: pairing between the , substrat^ and the catalytic " ■ 
RNA is maintained in the "hairpin" configuration (see Fig; 13). When these two RNAs were mixed under 
standard catalytic conditions as described in Example 3, the target was cleaved demonstrating that a 
35 sequence found within a native RNA can be cleaved. 



Example 18 

Three RrsIA sequences found within; the;; sequence of the messenger RNA for the enzyme chlorarh- : 
phenicol acetyl transferase (GAT) were sythesized using the methods described in : Example 2. The - 
synthesized substrate RNAs had the sequences (A) gUUUCAGUCAGUUGC;: (8) gUUUCAGUCAGUUGGUC; 
and (C) gggUUUCAGUCAGUUGCUC/U (se^ Fig! 14). 
45 Note that the latter two sequences are extensions of the first sequence and that additional bases have 
been added to the 3' end in the "region that the "hairpin" model predicts win" 'base "pair, with- the cataly tic 
RNA to form Helix 1 (see Figure 18). Also, substrate (C) had two additional G vector bases as compared to 
substrates (A) and (B). The site of cleavage after the A in the AGUC cleavage sequence (see the arrow in ; 
Figure 14) of the substrates corresponds to position 320 of- the CAT gene using the number system found 
so in the Tn9 sequence (Alton and Vapnak, : Nature, 282, 864 (1 979)). In Figure 1 4, the open boxed bases > are ^W' 
those which are different from those in the native (-)sTRSV substrate RNA sequence shown in Figure 1 . ■'.'■, 

• Catalytic RNAs corresponding to substrate RNAs (A), (B) and (C) were synthesized according .to the - : .r :: .'-' ' 
methods described ;;in Example, 2.: Their ^ sequences were -designed so - tha wouldxbase: pair ! with:: the: : ; . ..;■£' 
■ substrate RNAs M both the 3' : and 5' regions flanking ; ; the : AGUC cleavage sequence: In addition; the : : 6 
: : ■ 55 : catalvtic RNAs designed to react; with substrate RNAs (A) and (B) each ; contained the vector sequence G A 

at their 5- terminus; and the catalytic RNA designed to react with substrate RNA (C) contained the vector v v ! 
sequence GGG at -its- 5 . terminus. Otherwise, the catalytic RNAs had the same: sequence |as the\(-)sf RSV 
catalytic RNA sequence shown in Figure iv . - 
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The various , substrate and catalytic RNAs were reacted and the reaction products analyzed as 
described In Exampi^ were as described in Example 3, except for the following. 

For substrates (A) and (B); reaction conditions were: substrate RNA concentration = 0.05uM; catalytic RNA: 
. concentration " 0.005uM; reaction run at 18* C; and reaction time of 20 minutes. For substrate (C), the; 
5 reaction conditions were the same as for (A) : and (B), except that the reaction time was 40 minutes and •:• 
: . temperatures were 20 V C, 25 * C; 30*;C 37:- C.-/.;'. 

Cleavage of ail of the. substrate RNAs occurred: when 'they were mixed withthe correspond|ng:cataiytic. ; : 
RNAs as is; shown in Figure .14/ demohstratihg that ', catalytic.: RNAs ;• according tov the i nvention can { be . ^ 
synthesized which cleave: specific RNA sequences found; ^ 
10 demonstrates that cleavage of substoe RNA can be obtained e^^ length; of 

3 f end of the substrate which . base - pairs witii the catalyst ; ^ that ; forms : Helix 1 with the 

substrate according to the "hairpin^mo^ 

to 10 bases in substrate (C),: the reaction could then ^ 

having shorter sequences in this regidn, the reaction would proceed ; only at 1 6 ; G;: : 

. V Example 19 

20 An RNA substrate corresponding to part of the sequence of HIV-1, the virus which causes AIDS, was 
synthesized as described in Example 2. The sequence of this substrate RNA is shown in Figure 1^; 1Ti& , .::•' 
arrow:jn Figure 15 shows the cleavage site which corresponds to position 804 in the sequence of HIV-1 
V : strain SF2CG :. (Sanbhez-Pescador • et ; aL- Science , 227 , 484 (1 985). The sequence shown is found in the 

region; ^ The RNA substrate also had a GCG ^ . 

2S ^qu6nc^ which are different tha^ 

)sf RSV &bstir^ . 

• A catalytic RNA w^ synthesized :^ : to the methods of Example 2. its sequence was designed 
: ; so that it would: ba^e ^ both the 3' and 5' regions flanW^ 

sequence. In addition, the catalytic RNA contained the vector sequence GGG^a 
30 the cafcly tic RNA had; the same sequence as the (-)sTRSV' catalytic RNA sequence shown in Figure; il v 

The catalytic RNA and the substrate RNA were reacted and the reaction pnxJucts^ • ; 

descried in ; Ex^m^le- 3; The reaction conditions were as set forth in Bcample ;3, ; ^ 
temperature^ '^ere!^9i^^^^ # C, 30* 6 and 37* C. Also, the reaction wai run for ^ m 
for 37 ' C which was ruhvfor 15 minutes, and the substrate RNA concentration was 5QhM- arid the ^ v- • 
35 RN A conrahti^tibn : was i5nM^ i 

;;^;^ substrate at the expected , . 

CGUC cleavage sequence found in the loop between the two flan king : stern regions. Thus; ^specific; . 
, sequence found in the HIV-1 viral RNA that codes for the gag protein can be>^ 

occurred with an RNA substrate having a 16-batse target sequence^ S17 in the region 

40 at the 3end which base pairs with the catalyst (i^, the portion t^ with the: catalytic RNA 

according to the "hajrpih" model) - Also,: the reason occurred at^p ; 

Example 20 

45 • •; ;'• '.• • : : ' . ' . .v. ." ' ' ./.; ■ .' . ■ ' " . _ 

TV RNA: having a sequence found in the beginning of the coding region, for the regulatory 

protein ta^ as described in Example 2. The substrate sequence is shown in ; ; 

Rgure 16.; In addition to the HIV-derived sequence, i: the substrate RNA - had a ; GCG : 5 ; vector sequence. The 
so : open boxes are around bases which are different than/those of 

shown in Figure 1. /:.;;•/,/:•••.; 

A catalytic RNA -■hWng'-a sequence so that; it w^ 
flanking the UG UC loop (I .e;, the regions that forms Helices 1 and "2 with the catalytic RNA according to: the 
"hairpin" model) was also synthesized as described in Example 2. Otherwise, the catalytic RNAs had the 
55 same sequence as the (-)sTRSV catalytic RNA sequence shown in Figure 1, except that it had an additional 
■ 5 G vector base.' ■. •; .. '. , ;: .- \- ..[.}■■■].. , • •../ 

The; substrate RNA and catalytic RNA : were' ^ as 
. described in; : Example 3. Reaction conditions : ; were: : 37 * C; reaction times of ; zero arid 1 5 minutes; the 
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40 



concentration of substrate RNA was 100nM; and the concentration of catalytic RNA was 20nM. 

Cleavage occurred as expected between the "IT and the "G" in the UGUC cleavage sequence located 
between the two stem regions of the substrate. The cleavage site is indicated by the arrow in Figure 16. 
This, is .position 5366 in the sequence of HIV clone h9c (Muesirig et al. f Nature , 313, 450 (1985). These 
results again confirm, that a catalytic RNA designed according to the "hairpin " model can cleave a specific 
target sequence located in a naturally occurring RNA, In this case a key regulatory region (tat) of the HiV-1 
virus which causes AIDS. rfvM: V; 



10 Example 21 : 



Using the methods described in Example 2, a substrate RNA having; four riohrhative bases (UUUU) 
added to the 3' end of the sequence of the native (-)sTRSV substrate shown in Figure 1 and a : 
is corresponding catalytic RNA designed to base-pair with the substrate RNA in the 3 and is' reg of the 
... sub^^e flanking die cleavage sequence (I.e., the portions that form Helices 1 ;a^ catalyst 

. . ; according;: ^ 
: v;includ 

: • The catalyst had the samei sequence as the (-)sTRSV- 

20 catalytic i RNA sequ that it had four additional AAM i bases at the 5'ehd^ 

designed- to base; pair; w^ 

attheS^te^ 

" The substrate and catalytic RNAs were^re at standard conditions and the reaction prbduct^^ 
analyzed as described in Examples 3 and 5. Catalytic RNA concentration was ^ 
25 RNA concentration was 0.1 uM (Lane 1), O.OSuM (Lane 2), 0.033uM (Lane 3), 0.025uivi (Lane 4), b.0i6uM 
(Lane 5), 0.0l2uM (Lane 6), 0.006uM (Lane 7), and 0.003uM (Lane 8). The results are shown in Figure 17. 
Cleavage rates at each concentration of substrate were determined by cutting but the bands, counting 
radioactivity in a liquid scintillation counter and plotting the data using Michaeiis-Menton procedures to 
calculate Km and kcat (see Exarhple 5). 
30 : ■ the data show that- an extension of the length of the region of basef pairing-; between the substrate and 
; catal^:^ 

can : imp^ of the reaction. Cleavage of- ^ 

expected site,^^ as compared to the 

number of the ; reaction :w^ 

35 of substrate RNA per minute during the reaction. The kcat for- Si 7 cleavage by R5 i was;; 2:1 /mfhutei the 
Km of the reaction was the same as for S17 cleavage i by : R51 (30n!vty This shows that by optimizing: the 
length of the region of the catalyst that base pairs ;to the sub strate in the ;3^; region flanking the cleavage 
sequence (i.e., by optimizing the length of Helix 1 predicted by the ; "hairpin " model), the catalytic properties 
of the native reaction can be improved. 



Example 22 ;; 



45 ; A series: ^ 

base substitutions; a^coirn^ catalytic RNA sequence ; shown in Figure it; The 

substitutions, which are illustrated in : Figure 19, were the folldwi ng : (1 ) At nucleotide 35, : G ; was replaced by 
C (G35-->C) (count bases from the. $ end of the catalytic ; RNA sequence shown); (2) A double mutant, 
having the G at position 35 replaced by C and the;: C at position 27 replaced by G (G3^-> C; C27-->G); (3) 
50 At nucleotide 47, G was replaced by C (G47-->C); and (4) A doubie mutant having the G at position 47 
replaced by C and the C at position 17 replaced by G (G47->C; C17~>G). All catalytic RN As had an 
additional "G" vector base at their 5 end. 

The various catalytic RN As were reacted with substrate SI 7 and the reaction products were analyzed 
as described in Example 3. The results are shown in Figure 19 where Lanes 1, 3, 5, 7, 9 arid11 are at zero 
55 time, and Lanes 2, 4, 6,. 8, 10 and 12 are 15 minutes incubation under standard cleavage conditions (see 
: : . Example 3). : The concentration of catalytic RNA was 0.0065uM, and the concentration of substrate RNA was 
; 0.17ulvi. The. temperature was 37* C: The control, lanes- 1, 2, 7 and 8, was cleavage of -the native substrate 
RNA S1 7 by- catalytic RNA R51 which has the native; (-)sTRSV sequence (see Figure l): 
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As shown in Figure 19, the catalytic RNA G35->C had no catalytic activity (see Lanes 3 and 4 of 
Figure 19, where this catalytic RNA is designated "mis match" since the base substitution at position. at 
position 35 results in a loss of base pairing). The double. mutant catalytic RNA G35-> C; C27~>G showed 
. . restored catalytic activity (see Lanes 5 and >6 of* Rgure 19 where this catalytic RNA is designated 
5 "substitute b.p. n since the net/effect of the two base substitutions -is : to create a base pair, but a base pair; 
different than the one found in the native (-) sTRSV catalytic RNA). The» catalytic RNA G47~>C was also 
inactive (see Lanes 9 and 10 of Figure 19), while the double mutant, with the: second mutation C17«->G, ; 
showed restored activity (see Lanes 11 and 12 of Figure 19). 

The results of these mutagenesis studies provide .confirmation for the existence of Helices 3 and 4 (see 
io Figure 19) of the "hairpin" catalytic RNA model proposed herein. The results show that whenmutagensis 
was carried out which . caused a base, pair mismatch in the: region of proposed ba^se pairing,, the catalytic 
RNA was inactive. However, when a second mutation was carried out such, that the base: pairwas restored, 
. although the new base pair was different than; the: original base pair, , catalytic activity was restored. Such; 
: ■ ; : results are considered evidence for the existence , of, a helix (Fox ; and Woese,- Nature, 256, 505 (1 975).' • ,. 
76 ' These results also show that a variety of. catalytic RN As \having sequences different from the native ; (- 
)sTRSV sequence ^e catalytically active if : they are designed^ 

the "hairpin" model, such as maintenance of substantial base pairing in regions of preidirted helices. 
20 - Examples 23 



A synthetic "hairpin" autocatalytic cassette was prepared. The RNA sequence of the cassette is shown 
in Rgure 20. Several bases have been added at the 5 end of the catalyst as compared to the native (- 
25 )sTRSV sequence shown in Figure 1. The effect is to close the open end of the "hairpin" and to provide a 
substrate sequence (i.e., a cleavage sequence and upstream and downstream flanking regions) which can 
base pair with the subsTrate binding portion of the catalyst. 

: The cassette was" prepared by making an appropriate synthetic ON A template that would yield an RNA : 
; with the -sequence s^ the DN A template to transcribe RNA as described; in 

30 Example 2, with the following differences. After synthesizing the DNA template; was inserted into the ;; 

vector pTZ18R (US Biochemical) Into which a new multiple cloning site had- belen inserted. The new;/; . : 

multiple cloning site was! a construct containing sites; in: 5'-> 3' orden cleavage by the following. 

enzymes: BamHli Xhol, Apal, Sacll, Nael, Stul^ Kpnl, Mlul. The new multiple cloning site was Inserted into 

vector pTZ18R by cleaving the vector with Bam HI and Salt and then ligating the multiple cloning site to the : 
35 vector using T4 ligase. Located 5 to the inserted multiple cloning site; is the T7 RNA polymerase promoter. 

The vector containing the multiple cloning site was digested with - Mlul and Sail, and the DNA template 

coding for the autocatalytic cassette was ligated into the vector : qsing: T^ 

then linearized with Hindlll, and transcription carried out as described in Example 2. All restriction enzymes 
; and the T4 : ligase were obtained from IB1 and used according to manu^ 

40 : ••:;• After 1 being transcribed;; the! cassette spontaneously underwent ian lntramolecular autocatalytic cleavage 
at the expected site: to give the appropriate 3,'f and 5'F products : (see figure 20). Note that the effect of this 
is to autocatalyticaliy ; terminate a transcript; For example^;; is; in; itself a transc^pt which; has been 

terminated at its 3' end by the autocatalytic reaction. Further note that this termination only leaves five 
essential bas^s at the 3' end of this 5' F. These are UGACA which are boxed in Figure;! 20:- Thus, it is 

45 possible to very efficiently terminate transcription with this autocatalytic cassette arid leave only 
3' end to the transcript (Le., the s' fragment of the autocatalytic cleavage). 

Example 24 

so . 

A catalytic RNA was prepared as described in Example 2. Its. sequence was identical to that of R5T, 
except that the base at position 21 7 : in the AG AA loop was changed from: a G to a;C. The AGAA loop of the 
catalyst is opposite the cleavage sequencer of the substrate when the substrate and catalyst are complexed 
55 (see Rg. iy Anofter catalytic prepared as described in Example 2. Its sequence was also identical;; 

' to R51, except that the base at position 21 6 was changed from A to U. : . .■::■*>.. ' ' \. 

These, catalysts were reacted with the RNA substrate S17 under standard conditions as described in 
Example 3 at 37* C for 0 to 15 minutes. The control, was the reaction of substrate. S17 with catalyst R51. 
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The concentration of the three catalytic RNAs was 0.007uM, and the concentration of substrate RNA was 

o.7um. ; . . • ... " ; ; . ' . - : •;./, : ; , . . ;.: p ^ .-• . . " .•■ rr 

The reaction products were analyzed as described in.: Example 3, and the results are shown in Figure . 
21, where the first, lane in each gel is zero time and the secpn'd 

Figure 21 . changes in .either one of these two. bases (G217r->C: and ; A21 6-> 0) in the loop opposite the ^ 
cleavage sequence of the substrate destroyed • trie : activity of the catalyst. These results indicate that ■ these; 
two bases are likely invariant in the native (-)sTRSV catalytic sequence. 



Example 25 

Substrate RNAs identical to S 17, but with one of the bases at positions 46, 47 or 48 (i.e., bases GUC of 
the cleavage . sequence) : changed to a different base, were ! prepared as - described in Example 2r VVhen 
these : substrates 1 were reacted.:, with : the: RNA catalyst RSt under standard conditions for 60 . minutes as : 
. • described in • Example 3, , no cleavage of the substrate occurred . this shows : conservation of the . G UC . 
sequence of the cleavage sequence of (-)sTRSV RNA Trie results of these: experiments combined with the ; ■ 
results of Examples 13-15 show that the cleavage sequence of (-)sTRSV RNA is NGUC, where N is any : ; 
base.. . . 
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: . The vector prepared in Example : : 23 (hereinafter referred to ; ; as "pHC"); containing the "hairpin R ' 
autocatalytic cassette was tested; for. activity in; vivo as follows. First/ the CAT gene : was removed from 

. - .:: plasmid p MAMNEO-CAT (purchased from Clontech Inc.); with Smal and )0toI . > It was thehi ligated using f 4 % ■ ■:' 

,.. ligase into pHC which has been cut ' with Sma l and Xhdl to produce vector: pHC-CAT; :" 

The; original : vector used in :; these constructions (pTZ1 8R); ' see Example :; 23) ^contains an inducible i : • : : 
promoter (lacZ) ; arid , as a result of the steps -described in Example 23 arid immediately above in this i 
example, . trie CAT gene and the "hairpin" autocatalytic cassette were placed region (s^e; 

Figure 22A). Also, the CAT gene; and "hairpin " autocatalytic cassette were : linked so that the expected 
transcript would be as shown in Figure 22A (the "CAT-cassette RNA"). thus, the RNA sequence coded for 
by the "hairpin" autocatalytic cassette would be expected to serve as a chain terminator for the CAT 
transcript by cleaving at the indicated cleavage site {see Figure 22A). The expected 5 fragment of such a 
cleavage is also shown in Figure 22A ("Cleaved GAT-cassette RNA"). 

fvtext, pHC was transfected into Eschericia coli strain JM109 with calcium chloride using standard 
procedures as described In Maniatis et al. t Molecular Cloning (1983). Transformed cells containing pHC- 
CAT were grown in LB broth at 37°C and then induced with 1m M isopropyl-beta-D-thiogalactoside (IPTG) 
for one hour to allow expression of the lacZ region, including : the GAT; gene-cassette ; RNA transcript. : 

;■;: ;;; ; At the end of this: time, RNA was isolated by incubating the: ceN 50 mMTris; pH 8.0, 50rriM EDTA, 1 
rrig/mi lysozyme at" room . ternperature for ;16 ; rriinutes to lyse the: ceils. The ;ly safe was made to 0;5% ; : 
sodium dodecyl sulfate (SDS), and then centrifuged to remove cell debris.- Pheriql; was added to ^ 
supernatant at a ratio of ;1;1, and the supernatant was centrifuged to remove the precipitate. T^is proc^ureF;: 
was repeated, and the resulting : aqueous phase was treated with an . equal volume of isopropariol at -20 6 C 
for 20 minutes to precipitate the RNA. The precipitate was collected by ; ceritrifugation, drted and redissolved • 
in water. ' ■ ' . ' -. ; ■:;.-"• ■ ■' 

This isolated RNA was electrophoresed on 1 .2% agarose; gels containing formaldehyde as described in 
Current Protocols In : Molecular Biology (Greene 1 989). - After, electrophoresis, Northern blots were -carried out y 
using published methods, ( GeneScreen Plus, DuPont, July 1 985). Two. DN A probes jware^ used for; blotting 
the ; gels : The CAT probe was prepared by primer extension of the CAT gene Using the Klenow fragment of : : 
C)N A ; polymerase 1 1 and - dATP labelled with alpha: P 3 ?. The "hairpin " ^iautocatalytic cassette probe was : ^ 
prepared by kinasirig the DNA complement to the entire "hairpin" autocatalytic -cassette 
shown in Figure 20 with dATP labelled with gamma P 32 . The: expected positions of binding of these prbbes : (• 

, ;to the CAT-cassette RNA; and expected ,S fragrnent are shown in Figure 22A. 

; The results of the Northern "blot test are shown in Figure 22 B; As shown there, when the CAT; probe 
was used, the full length. CAT-cassette RNA transcript arid the expected 5 ; fragment werei detected oh the ■ 
gel, indicating that cleavage has taken place in vivo. 
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When the "hairpin" autocatalytic cassette probe wa^ 
. transcript was • detected (see Figure 22B). The fact th^ the 5^ fragment not hytitifiize: with this :probe : : was 
to: be expected r since most o^ 3 :: fragment:; 

after cleavage: The fact that the 3^ fragment Was not detected ■ by Northern blot analysis is aisb riot : 
s surprising. The : s' terminus of the 3' fragment would contain a s'-OH and not the 5 -ppp: which is ordinarily : 
seen in RNA transcripts. Thus, the. 3' fragment would be expected to be very labile in vivo and was likely 
degraded immediately after the autocatalytic cleavage. ; . 

The mobilities of the CAT-cassette RNA and the Cleaved CAT-cassette 5 fragment on the gel 
corresponded exactly to the predicted length of the transcripts. The standards run on the gel were E. coli 
w 16S and 23S RNA. 

As a control to determine if cleavage of the target RNA occurred during isolation of the RNA.intact,: 
uncleaved ; -hairpin" autocatalytic RNA separated on an acrylamide gel as described : in Example 23 was 
; . isolated from the gel using conditions similar to those described above , (he., no divalent cation, but in the 
, , presence of EDTA and SDS). On ly . intact, uncleaved RNA was obtained when: the; catalyst was re- 
16 electrophoresed. ; 

As various ^changes could be made in the above-described products and methods without departing 
from the scope of the invention, it \s intended that all matter contained in the above description or shown .in; 
accompanying drawings shall be interpreted as illustrative and shall not be interpreted in a limiting sense. 
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Claims 



1. A synthetic RNA catalyst capable of cleaving an RNA substrate which contains the sequence: : ; 

5-F1-CS-F2-3', 

25 wHereln^•. V•'V.;:; .;•'••'• 
CS:is a cleavage sequence; and 

Fi and; F2 each is: a sequence of bases flanking the cleavage sequence;; " 

the catalyst comprising a substrate binding portion and a "hairpin " portion, the substrate binding portion of 
" the catalyst having the sequence: ■■. 
30 3 # -F4-Li-F3-5', '". 
■ wherein, ': 

F 3 Is a sequence of bases selected so that Fa is substantially base paired with F 2 when the catalyst is 
bound to the substrate; 

F* is a sequence of bases selected so that F* is substantially base paired with F1 when the catalyst is 
35 bound to the substrate; 

the sequences of F3 and F* being selected so that each contains an adequate number of bases to achieve 
sufficient binding of the RNA substrate to the RNA catalyst so that cleavage of the substrate can take place; 

and •/•:;; • /• v . : " " : : • : ..-" v " . ■'■.;.[' ■ " ■ ■ 

: Li is a sequence of bases selected: so that Li does not base pair with GS when the catalyst is bound to the 

4o ' substrate. 

2. An RNA catalyst according to Claim 1, the "hairpin^ portion. of the catalyst having the sequence: : . , 

. ■■ ■ ■^s^§-3^ : ;-' 

45 ' 

50 wherein, ' ; : : \. ' ; . ;" : \^\-y r ^ ^ ^}-;,..\ \ :"' 

; Pi and P 4 each is a sequence of basest the sequences of Pi and P*: being selected ;so ttiat Pi and P* are 
'. substantially base paired; "-Sj-'y.-in 
Pi is covalently linked to ■ Ft ; 



Pi is - covalently - linked ■ to ■ Ft ; ; ■ 

Si and S2 each is a sequence of bases, the sequences of Sr ; ;and. S2; being selected so that. St and $2 are 
55 substantially unpaired; 

P2 and P3 each is a sequence of bases, the sequences of P2 and P3 being selected so that P2 and P3 ana 
substantially base paired; and 
L2 is a sequence of unpaired bases. 



23 



EP 0 360 257 A2 



3. An RNA catalyst according to Claim 1 or 2, which is capable of cleaving an RNA substrate in which 
CS has the sequence 5 -NGUC-3', wherein N is any base and the substrate is cleaved by the catalyst 
between N and G. . 

4. An RNA catalyst according to Claim 3 wherein Lj has the sequence .3'-AAGA-s'. 
s 5. An RNA catalyst according to Claim 1 or 2 wherein Fa is at least 3 bases in length and F* is from 3 

to 5 bases in length, and the catalyst cleaves a substrate wherein Fi arid F2 each is at least 3 bases in 

■ length.-- \ 

6. An RNA catalyst according to Claim 5 wherein F3 is from 6 to 12 bases in "length and F* is 4 bases 
in length, and the catalyst cleaves a substrate wherein Fi is 4 bases in length and F 2 is-from : 6 to 12 bases 

to in length. 

7. An RNA. catalyst according to Claim 2 wherein Pi and P* each is from 3 to 6 bases in length. 

8. An RNA catalyst according to Ciaim 7 wherein Pr has the sequence 5'-AC.CAG-3' and Pi has the : 
sequence 5-CUGGUA-3'. ■ • 

9. Ari RNA catalyst according to Claim 2 whierein Si and S2 each is from:4 to 7 bases in length, v . 

75 10. An RNA catalyst ^according to Claim 9 wherein Si has the sequence 5-AGAA-3' and Si has the!: ; .; 
.sequence 5 -AUAUUAC-3'. - 

1 1 . An RNA catalyst according to Claim 2 wherein; P2 iahd Pa each is; from 3 lehgth; ; f 

1 2. An RNA catalyst according to Claim 1 1 wherein Pz has the. sequence 5^ and P 3 has the 
sequence s'-GUGGUO'. - 

20 13. An RNA catalyst according to Claim 2 wherein L2 is at least 3 bases in length. 

14. An RNA catalyst according to Claim 13 wherein U has the sequence 5-GUU-3'. 

15. An RNA catalyst according to Claim 2 wherein 5'-Si-P2-L2 has the sequence 5'AGAAACACACGUU- 

3. 

16. An RNA catalyst according to Claim 1 or 2 which is capable of cleaving an RNA substrate selected 
25 from the group consisting of messenger RNA, transfer RNA, ribosomal RNA, viral RNA, nuclear RNA, 

organellar RNA and other cellular RNA. 

17. The catalyst of Claim 18 which is capable of cleaving an RNA substrate selected from the: group 
consisting of HIV-1 virus RNA and tobacco mosaic virus RNA. 

■ 18. An RNA catalyst according to Claim 16 which is capable of cleaving messenger RNA coding; for 
30 chloramphenicol acetyl transferase. 

19. An RNA catalyst according to Claim 2 containing the sequence: 



36 




wherein, : : '/.. X^X-r^X- ' 'X './XX- XS-XXX ' X/\... 

Fi , F 2 , F3, F4. Li , L 2 , Si , Szi P1TP2VPV and P* are as defined in; Claim si1 ah^ 2; and: " 
L3 \$ a sequence of unpaired ".. bases . that covalently links the catalyst portion of ; the molecule with the 
substrate portion to produce a synthetic autocatalytic RNA catalyst • 

20. An RNA catalyst according to Claim 19 wherein CS has the sequence 5 -NGUC-3', wherein N is 
any base, and the substrate is cleaved by the catalyst between ;N and G. 

21 . An RNA catalyst according to Claim 20 wherein L ( has the sequence s'-AAGA-s', 

22. An RNA catalyst according to Claim 21 wherein 5 -Pt -Si -P 2 -L2-P3-S2-Pa-3' has the sequence 
5'-ACCAGAGAAACACACGUUGUGGUAUAUUACCUGGUA-3'. 

23; An RNA catalyst according to Claim 22 wherein L 3 has the sequence 3-CCUCC-5'. 
24. A synthetic RNA catalyst, which is capable of cleaving an RNA substrate containing the sequence: 
■S^FtrCS-^-a,' . ■ 
:the .catalyst containing the sequence: ; : 
S'-Fa^i-Fi-ACCA •;;• 
55 , .and-; active- variants thereof v. i X.' 

.wherein/-- ■■ ' '; '-^X . • . '•' " ' ''. V '' ::: ; : -- '•• •.; •. 

CS is a cleavage sequence; 

Fi and F^ each is a sequence of bases flanking the cleavage sequence; 
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F3 is a sequence of bases selected so: that 1 F3 is substantially base paired with F 2 when the catalyst is 
bound to the substrate; 

F* is a sequence of bases selected so that F* is substantially base paired with F1 when the catalyst is 
bound to the substrate; 

5 the sequences of F 3 and F* being selected.so that each cpntajhs an adequate num .;. 
sufficient binding of the. RNA substrate to the RNA catalyst so that cleavage of the substrate can take, place;' ; : 
•: and' ."" v • .■■ ■ . . • • : - : : . • •: ' : :•:,; ' ' ; : ■ ■ : . /'• . ■ , . :• ..v .. V, : ;/ ', : - .; ' 
Li is a sequence of bases selected so that Li does not base pair with CS when the catalyst is bound to,the : : 
substrate. 

70 • . •'• 25. An RNA ; catalyst according to Claim 24 wherein F 3 ; is : a 

;5; bases in: iength/and the -catalyst cleaves a^s 
; •: • 26:: Ah : RNA catalyst ^according • to Clairn. 25 wherein 1 2 : ; bases In '"length : :and : Fi js; : 4k : '-; . 

, bases jaleh^th,'^ 

bases in length. 
is 27; An RNA catalyst according ;to 

has the sequence 5 -NGUC-3'; wherein N is any b^ is cleaved : by. the ^ 

• N and G. ' ; J ,. 

28. AN RNA catalyst according to Claim 27 wherein Li has the sequence 3 -AAGA-5 . ■ 

29. An RNA catalyst according to Claim 24 which is capable of cleaving an RNA substrate selected . 
20 from the group consisting of messenger RNA, transfer RNA, ribosomal RNA, viral RNA, nuclear RNA, 

organellar PNA and other cellular RNA. 

30. An RNA catalyst accordi ng : ; to; Claim 29 which is capable of cleaving an RNA substrate selecfed| ; , 
from the group consisting of Hi V-1: virus RNA and tobacco mosaic virus RNA. 

31. An RNA catalyst according to Claim 29 which is capable of i cleaving messenger RNA coding for 
- 25 -. - chloramphenicol acetyl transferase, i::.; 

32; An engineered DNA m^ 1 . 2, -1 ^or 24. 

33. A vector comprising - a DN to Claim 1 , 2, 19 or 24, . 
the DN A sequence; being pperatively linked 

34. The vector of Claim 33 which is capable of self replication in a h^^^ 

30 35. The vector of Claim 33 wherein the RNA catalyst encoded by the vector is capable of : 

RNA substrate selected from the group consisting of messenger: RNA, ^transfer; RiIvlA,: ribosomal RNA, viral; 

RNA, nuclear RNA, organellar RNA and other cellular RNA. 

36. The vector of : Claihi 35 wherein the RNA catalyst encoded by the vector is capable of cleaving an 

: RNA substrate selected ; f rom : thejgroup bonsisti ng of HiV^I virus RNA and tobacco mosaic virus RNA. 
35 37. The vector of encoded by the vector is capable of cleaving 

messenger RNA pbding; fqr c 

; ; ' ^ 33 and which is capable of expressing ; the ; 

A^met^od ;bf • cleaving ah; RNA isufe sequence: . . ' 

' 40.-5^;^ 

• wherein/ : - : . : . ./ ; .V.''; ; :;' : ■' . 
■ CS is a cleavage sequence; and 

Fr and F 2 each is a sequence of bases flanking the cleavage sequence; 

the method comprising contacting the substrate with a synthetic RNA catalyst comprising a substrate 
45 binding portion and a "hairpin" portion, the substrate binding portion of the catalyst having the sequence: 
S'FH.i-Fa-s', ■■ : " ' 

wherein; ; ;; ■■■■■■ ' ■■: ' 

F3 ; iis a sequence so that F 3 ; is substantially base paired with F2 when the catalyst is 

bound'to 'tha 

•so • Fi ;; is a sequence of bases "selected so that F4 Is substantially base paired with Fi when the catalyst is 

• bound to;^ /;'.. .:;';..;-'iv .'•••,;:•. 
.. the sequences 

sufficient binding of the RNA substrate to; the -RNA catalyst so thatcleavage of : tne; : substra^ 

and •. ' / ;-. : - .;" '■ ■■ '-\ '■ , •••• :; '•: ■■. '■, "■■ : ■ " : ' ' ••" ;■/ ■ ■ . : '■■} + •:<.;.' .' :•; / : ; : ': ; . / 

55 Lt is a sequence of bases selected so that LV does not base pair with CS when the catalyst is ;b6und: to. the 

substrate. . : - .[ •' .• : ;\--\ : :' : ' ".; ; :-; ; -. : ;:;:: : ; :; .;'•'•::;•: 

40. The method of Claim 39 wherein the "hairpin" portion of the catalyst has the sequence: 
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:• ■■■■■>:: ,x • ' -r^ - : \ , • ••••• - xx-x: •:• x-x x ; x-x- x- >:x:::::x:x::vx>:v, x x^x- 'v-: x-Xvx:: y x : x, :; : : : : : : : ,x x- . . •,: ,xx ; x - ,x : ; x : . -: :- 

-wherein-;.:^ ; . V: : ; :i % i 

Pi and : P4 each is a sequence of basesV the sequ arid Pi being selected so that P rand P* are 

substantially base paired; 
Pi is covalently linked to F*; 

Si and $2 each is a sequence of bases, the sequences of Si and S 2 being selected so that Si and Sa are 
substantially unpaired; 

Pz and P3 each is a sequence of bases, the sequences of P2 and P3 being selected so that P2 and Pa are 

substantially; basexpaired; and ; 

L 2 is a sequence; bases. , x .. . 

•41 /C; rnMbd iOf cleaving an RNA substrate containing the ; sequence: : 

comprising RNA catalyst containing the sequence: 

and active variants [thereof, wherein, 
CS is a cleavage sequence: • 

Fi and F2 each is a sequence of bases flanking the cleavage sequence; 

F3 is a sequence of bases selected so that F3 is substantially base paired with F 2 when the catalyst is 
bound to the substrate; 

F4 is a sequence of bases selected so that F4 is substantially base paired with F1 when the catalyst is 
bound- to the substrate; :; .: 

the sequences of F3 and F* being selected so;'to to achieve 

sufficient: binding 

ibases selected so t^ 

42; The ^ 41 wherein the cleavage occurs under physiological conditions. 

43. The method of Claim 42 wherein the cleavage occurs in vivo i n a host cell which has been 
transformed with a vector comprising a ON A sequence coding for the RNA catalyst the DNA sequence 
being operatively linked to expression control sequences. 
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This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
E^FADED TEXT OR DRAWING 
ETbLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCED) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



